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Description 

Background of the Invention 

5 [0001 1 This invention relates to prefix matching in database searches. 

[0002] A database associates sets of strings, or keys, with stored Information. Databases are frequently used to 
search for particular information associated vvith a given input string or key. 

[0003] Some applications also require the retrieved infbnnation to be associated with the best matching prefix, if 
any, of the input string. For example, if the string 'CART is the input stiing to a database, and the database holds intor- 
10 mation assodated with the strings 'C, 'CA'. and 'CARL', tiie best matching prefix to 'CART' is the string 'CA'. and 
the infbnnation associated with 'CA* should be returned. Note that 'C is also a prefix of 'CART*, but 'CA' is a better 
(i.e. longer) prefix than "C". 

[0004] Best matching prefix searching is typically performed by a database having a hierarchical, tree-like structure. 
This type of database is often called a trie. A trie database allows botti exact matching (i.e. searching for a string that is 
IS exactly equal to the input string) as well as best prefix matching. Refening to Fig. 1 . a trie consists of a number of nodes 
32 • 

[0005] each of which contain pdnters to other nodes. Each node has an array of n pointers, one pointer corre- 
sponding to each of n possible characters that can occur in a character of the input string. The ti-ie also has a single 
node 33 called ttie root, at which ttie search begins. 

20 [0006] To took up a string. e.g. 'CAO', a search starts at the root node of the trie and uses the first character 'C to 
index into the array of pointers at tine root node. The 'C" pointer 34 will point to a section of the trie that contains infor- 
mation for all strings that begin with 'C. The search travels to this new node, and uses the next character in the input 
string. "A", to index Into the array of stored pointers. The "A" pointer 35 yields the root of another section of the trie that 
contains information for all strings that of all strings that begin with "CA". Finally, the search uses the last character "D' 

25 to index into the array to obtain the actual entry corresponding to 'CAD". 

[0007] The storage requirement of a ti-ie can be calculated, and Is roughly proportional to the product of: (1) the 
number of entries in the database, (2) the number of distinct characters, (3) the average number of characters in a word, 
and (4) the storage size of a pointer. Thus for a 50,000 enti-y directory database having (2) 26 possfole charactws, (3) 
up to 20 characters per entry, and (4) 4 byte pointers, the amount of storage required is around 2K bytes per entry, or 

30 100 Mbytes. 

[0008] Despite this storage requirement, the trie is attractive for fast look up and prefix matching. Some useful appli- 
cations include directory look ups in a telephone context, on-line dictionaries, spelling checkers, and looking up social 
security numbers. 

[0009] A computer network consists of a number of computers ttiat are connected together by devices called rout- 
as ers, such that any computer can send messages, called packets, to any ottier computer. By analogy, ttie routers are 
post offices, and the packets correspond to letters. Each packet carries a destination address, and each router com- 
putes the best path towards that destination address. Each router along ttiis patti is responsble for fbnMarding' ttie 
packet to the next router on ttie path. This tonvarding process continues until the packet reaches its destination. When 
a packet arrives at a router, the router searches for the destination address in a tonvarding database. The forwarding 
w database consists of a list of destination addresses and ttie next router in ttie patii toward each such address. 

[0010] Since ttie postal system is too large, it is impossible for each post office to store a database containing 
enti-ies tor every address in ttie world. Instead, to route a letter to WHITEHALL-LONDON-ENGLAND, it Is first sent to 
the destination country (England), ttien to the city (London) and finally to ttie street address (Whitehall) in ttie destina- 
tion city. Thus we could describe ttie postal system addresses as having three levels of hierarchy: Level 0 is the street 
45 address. Level 1 is ttie city, and Level 2 the country. For the same reason, destination addresses in very large computer 
networks are also divided hierarchk:ally and have several levels of hierarchy 

[001 1 ] One mettiod for consti-ucting very large networks ttiat is descritjed by the International Standards Organiza- 
tion (ISO) Routing Standard. This is soon to be a worldwide standard which will be used to buiU large global networks. 
According to the ISO standard, each router does not store routing information for every possible address in the network. 

so Rather, it stores routing information for partial addresses. 

[0012] For example, a router might store ttie best ways to fonward a packet to ttie partial addresses DEC-READ- 
INQ-ENGLAND, ENGLAND, and LONDON-ENGLAND. Suppose ttie router now gets a packet addressed to WHITE- 
HALL-LONDON-ENGLAND. The ISO Standard states ttiat the router should send ttie packet to ttie best matehing 
partial address it has in its database. Thus, in the above example, since ttie router knows how to forward packets to 

55 LONDON-ENGLAND, ttte packet should be sent there. In this scheme, each time a packet is fonwarded It gets ctoser to 
its destination. 

(001 3] The ISO Routing Standard for worldwide networie specifies that each router in ttie networit maintain a data- 
base of partial addresses. When a packet arrives at ttie router, the router must search through ttie database and 
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retrieve the entry corresponding to ttie destination address in the packet or. falling that, retrieve the entry con-esponding 
to the best matching prefix of the destination address. 

[0014] A ISO routing standard of particular interest is the Open Systems Interconnection (OSI) standards, such as 
ISO 8348 Addendum 2 (ISO 8348/AD2), as promulgated by the International Organization for Standareiization. Under 

5 this standard, the administration of sub-spaces of an OSI address has been delegated to various internationally recog- 
nized organizations. Each of these organizations has been allocated a unique initial address octet (typically eight bits) 
indicating the delegated administration. The individual organizations are responsible for allocating further portions of 
the address, as identified by unique initial parts of a length specific to the organization, for administration and allocation 
by other organizations. This process can iterate many times, but guarantees that specific assigned node addresses are 

10 globally unique. 

[001 5] An OSI network address (NSAP) fbrmat is shown in Rg. 2A. It Includes an initial domain part IDP 60 and a 
domain specific part DSP 70. The format and length of the IDP 60 is standardized. It consists of two parts, the API 62 
(authority and format identifier) and the 101 64 (initial domain identifia-). These elements each require a specified 
number of bits, counted by octets (eight bits) or semi-octets (four bits). The digits in the API and IDI are binary coded 
IS decimal digits. Each decimal digit is represented by a semi-octet value in the range of 0000 (decimal 0) to 1 001 (decimal 
9). 

[0016} The API 62 is standardized as two semi-octets (i-e. two binary coded decimal digits) long and is used to 
specify the authority responsible tbr allocating IDI values, and for defining the fomiat of the IDI. The IDI 64 identifies the 
subdomain from which DSP values are allocated, and the authority responsible for allocating the values. Depending 

so upon the IDI format, the actual number of digits in the IDI field 64 may be fewer than the number of semi-octets which 
are allocated to the IDI field. The Preferred Binary Encoding specified by ISO 8348/AD2 specifies that the IDI be pad- 
ded with leading digits, if necessary, to obtain the maximum IDP lerigth specified by the API. Thus the IDI field may con- 
tain some digits 66 which convey address information, and other fill digits 65 which do not convey information. The 
useful IDI digits 66 are right-justified in the IDI field, and the remainder of the IDI field contains the fill digits 65. The value 

2S of the API can be used to determine the I DP length and to locate the useful IDI digits 66, as will be fully discussed below. 
[0017] IDI formats specified in the ISO 8348/AI32 standard include those promulgated by a nunrd3er of different 
authorities, including the following: 

X.121 UPtMc data network numbering) 
30 ISO DCC (Geographic address assignment under ISO control) 
R69 (Telex numtwring) 
E.163 (Telephone numbering) 
E.I 64 (ISDN numbering) 

ISO IGD (Non-geographic address assignment under ISO control) 
35 Local (IDI is null; address is not necessarily unique). 

[001 8] The IDI 64 identifies the authority which adnvnisters the DSP. The specific fomnat of the DSP 70, except for 
its maxinum length, is not presently prescribed by ISO but rather is left to the Indicated authority. The DSP may use a 
binary coded decimal syntax similar to the IDP, or may use a straight binary syntax. Where the DSP uses a binary syn- 
40 tax. the DSP value is represented directly as binary octets. Where the DSP uses a decimal syntax, each decimal digit 
is represented by a semi-octet in the range of 0000 to 1001 (as in the IDP). In the latter case, where necessary, the 
semi-octet value of 1 1 11 is used as a pad after the last semi-octet of the DSP to round the entire address length to an 
integral number of octets. 

[0019] Figures 2B and 2C are tables indicating the API values and maximum lengths required for IDP. DSP and 
4S entire NSAP address con-esponding to each IDI format. (Note that in NSAP addresses in ISO 8348/AD2 format, the IDI 
is padded to the maximum length.) Where two values are given for the API, the first identifies an IDI which is padded to 
maximum length with zero (0000) leading digits, while the second identifies an IDI which is padded with non-zero lead- 
ing digits (the non-zero padding digits must have the value 0001). Non-zero leading digits are used to alleviate confu- 
sion when the first digit of the actual IDI value is equal to 0000. Therefore, if non-zero padding digits are used in the IDI. 
50 the first zero digit in the IDI must be the first non-fill digit. Pig. 2B applies to cases where the DSP syntax is binary, 
whereas Fig. 20 applies to cases where A is decimal. 

[0020] As an example, a two semi-octet BCD API value of thirty-six indicates that: (1) the destination system is 
using an X.121 put)llc network address, (2) the IDI 64 consists of up to fourteen significant decimal digits idemifying a 
subdomain authority, and (3) the DSP 70 semi-octets, if present, will represent a destination device in Binary Coded 
55 Decimal syntax. 

[0021] In the current version of the DECnet Phase V addresses tor the Digital Netw rk Architecture (DNA), as 
promulgated by Digital Equipment Corporation, Maynard, Massachusetts, for example, the DSP 70 has binary syntax, 
and the last nine octets of the NSAP (the last seven of which must be in the DSP) are partitioned into several fields as 
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shown in Fig. 2A. (Those fields in Fig. 2A which are specific to DNA are marked with an asterisk (')) 
[0022] IjOC-AREA 72 Is a field defined for backward conpatibility with former versions of DNA and for possible 
future enhancements. The LOC-AREA 72 is defined as the first two octets of the last nine octets of the NSAP. 
[0023] Level-1 ID 74 is a six octet field which uniquely identifies the destination system within a DECnet area. Cor- 

5 rect operation of the DNA Network Routing Layer requires only that the ID 74 field be unique within a DECnet area 
(except for Level-2 routers, where the Let^el-I ID of the Level-2 router is typically unique within the whole private net- 
work). However, the ID field is usually chosen from the IEEE 802 address space, in which case it is guaranteed to be 
globally unique. If an 802 address is used, it may correspond to the actual Data Link address of the node on an 802 
LAN, but this oorrespondence is not assumed or required by the routing algorithms. 

10 [0024] SEL 76 is a one octet field at the end of a DECNET Phase V address. SEL acts as a selector for the module 
which is to receive the packet once it reaches its destination. The concatenation of the IDP 60 and the leading portion 
of the DSP (i.e.. if it exists, the portion of the DSP preceding the last nine octets) is called the PRE-LOC-AREA 80. The 
concatenation of the PRE-LOC-AREA and LOC-AREA is known as the Area Address 90. (Thus the Area Address is all 
but the last seven bytes of the NSAP). If a packet has an Area Address 90 which exactiy matches that of the local area, 

IS then the packet's destination is kxa\ to the area and is routed by Level-1 routing, using the Lei/el-1 ID field 74. Other- 
wise.it is routed by Le)/el-2 routing. Level-2 routing acts on prefix portions of the area address, directing the packet to 
that area whose area address has the maximum exact match with the packet address. 

[0025] Other, non-DNA nodes need not follow DNA addressing conventions or requirements. However, routers 
designed for DNA address syntax will interoperate with non-DNA nodes and non-DNA networks if certain requirements 

20 are met. There are several possible modes of interoperation: 

[0026] in one mode, a non-DNA End System is operating in the DNA Level-2 network, and an adjacent Level-2 
router is manually configured to fonward packets to the End System via a DNA "Reachable Address Entry". The only 
requirement of the address of the non-DNA End System is that every prefix of the End System's address, formed 
removing at most 14 trailing semi-octets, must be distinct from all Area Addresses in the Level-2 network. 

2S [0027] As an End System in a particular DNA area, the address of the non-DNA node is subiecit to the restriction 
that the leading octets, prior to the last 7 octets, must be equal to the Area Address of the area in which the node 
resides. Additionally, the leading 6 octets of the last seven octets must constitute a unique Level-1 ID within the area. 
Configuration of the adjacent router occurs manually, or, automatically via the ES/IS (ISO 9542) protocol. 
[0028] Finally, a DNA network will interoperate with autonomous networks of non-DNA nodes via Reachable 

30 Addresses, using address prefixes. 

[0029] Routing in a network is based on a fbnwarding database. In a fbrwarding database, each listed destination 
address Is cross-referenced with the next link, and the address on that link, of the routing path a packet should take to 
reach its destination. 

[0030] The database may be divided into two parts : (i) a part which maps network addresses onto internal indices, 
35 and (iq a part which maps the internal indices onto sets of links and link address elements. 

[0031] A network router obtains the destination address information from the header of a received packet, accesses 
the database to determine the best next link through which to route the packet and the data link address on that link, 
and fonvards the packet accordingly. 

[0032] Known database formats affect the rate at which packets are fonwarded, and the storage requirements of the 
40 database may be large. 

[0033] ACM Transactions on Database Systems 14 (1989), pp 41—74; R Ramesh et al: "Variable-Depth Trie Index 
Optimization: Theory and Experimental Results" describes a technique for reducing the storage requirements of a trie 
database by removing sub-tries from the datatiase, and using a non-trie search technique to search within the portion 
of the database corresponding to the removed sub-tries. 

45 

SMmmary of the Inventipn 

[0034] According to the invention, a routing database requires less space than prior art databases for identical func- 
tion. In addition, routing information is located more quickly using less expensive hardware. 
so [0035] The present invention provides method and apparatus for conducting searches along a path length of 
reduced length according to the appended independent claims. 

[0036] In one aspect, a node which would othenwise occur between a previous and a following node in the search 
path is eliminated, and information is stored as to whether, had said eliminated node been present, the search would 
have proceeded to the following node. During the search, a search argument is compared witti the stored information, 
55 and the search effectively progresses from the previous node directiy to the following node if the comparison is positive. 
[0037] Pr fen-ed embodiments indude the following features. Some nodes provkle result values for the search. A 
node is eliminated only if its presence would not affect the result value for the search. The infbrmation to be compared 
is stored in tiie following node. The search has first and second modes, thef irst mode including processing nodes along 
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the search path. The eliminated node Is one that if present, would either cause the search to progress to the following 
node, or cause the search to enter the second mode. The search argument comprises a series of search segments, 
some values of segments of said argument corresponding to nodes along the search path, some other values of the 
segments relating to the second mode of the search. The stored information is a sequence of said search segments. 

5 Intficators are associated with nodes, each indicator indicating the segments corresponding to the second mode. The 
search path is searched by processing successive search segments, the processing including inspecting the indicator 
associated with each node, and proceeding to the second search mode if the indicator indicates that the segment 
relates to the second mode. The second mode of the search comprises terminating said search. The search argument 
comprises a system address in a network. 

10 [0038] Other advantages and features will become apparem from the following description of the preferred embod- 
iment and from the claims. 

Description of the Prefen-ed Embodiment 

IS [0039] We first briefly describe the drawings. 

Fig. 1 is a diagram of a trie database. 

Fig. 2A is a diagram of an OSI network address format 

Rg& 2B and 2C are tables of IDI formats. 
so Fig. 3 is a block diagram of a router having a recognition engine 

Rg. 4 is a diagram of an uncompressed routing database structure. 

Fig. 5 is a diagram of a pointer-compressed routing database structure. 

Fig. 5A is a diagram of a pointer-conpressed node structure. 

Rg. 6 is a diagram of a path-compressed routing database structure. 
2s Fig. 7 is a diagram of a partial database structure illustrating the provisions for IDI, DSP and Level-1 processing. 

Rg. 8 is a diagram of the datapaths of a recognition engine. 

Rg. 9 is a table illustrating the locations of node data in the memories of Rg. 8. 

Structure and Operation 

30 

[0040] Tries are an appropriate cancfidate for maintaining tiie a database of partial addresses in ISO routers. Tries 
support a best matching pr^ix search of a destination address easily In addition, tries are appropriate for searching 
large and variable length strings such as ISO partial addresses. However, the storage requirements of tries can make 
the use of tries in practical routers infeasible. Also, to apply a trie to ISO addresses, ttie trie search mettiod must be 
35 adapted to the particular characteristics of ISO addresses, such as the variable-length fields and fill digits disojssed 
above. 

[0041] The invention reduces the storage requirements of a tile database, and also adapts a ti-ie search to ISO 
addresses. The storage reductions accomplished by the invention are detailed in Appendix C. 
[0042] The methods for reducing trie storage requirements are applicable to tiles in a wide variety of contexts out- 
40 side of the prefenred embodiment; the methods for adapting trie searches to ISO addresses can also be useful in other 
contexts. 

[0043] Refen-ing to Fig. 3, in the primary operation of a router 10 a packet 11 is received at a receive unit 12. 
Receive unit 1 2 presents the packet to a forwarding engine 1 4. The tonwarding engine 1 4 processes the packet header, 
extracting the destination network address and transferring it to a recognition engine 20. A logic unit 22 provides ttie 

45 logic signals used by fonwarding engine 14 to interface with recognitfon engine 20. Recognition engine 20 uses the des- 
tination address to recover an indec from a database stored in memory 50. This index is returned to the fbnvaiding 
engine 14, which uses the value to access a forwarding databiise (not shown). The fonwarding database consists of 
sets of link references and link addresses. Forwarding engine 14 ttien uses the link reference to direct packet 1 1 to one 
of several transmit units 26A through 26D (four being shown for illusbrative purposes). Transmit units 26A through 26D 

so are responsible for queuing packets and sending ttiem via the network communications links 1 6 to other routers or des- 
tination systems. 

[0044] In address recognition engine 20. a fetch unit 30 receives the packet destination address from tonvarding 
engine 14 and presents the address, or specific fragments of the address, to a search unit 40. Search unit 40 uses ttie 
address or address fragments presented by fetch unit 30 to directiy access memory SO. Intermediate results fetched 
55 from memory 50 are processed by search unit 40 and are used, in conjunction with the address presented by fetch unit 
30. to access furttier intermediate results and ultimately ttie final result The final result is returned by search urit 40 to 
forwarding engine 14 which uses the result as an index into ttie forwarding database (not shown). 
[0045] In ottier nmdes of operation, the database may be used by a management engine 18. (See Appendix B) 
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Management engine 18 is responsilDle for creating and maintaining the database stored in memory 50. Management 
engine 18 receives information about the current network connectivity on a continuous tiasis. through paths which are 
not shown. Management engine 18 presents address fragments, generated from network connectivity information, to 
search unit 40. The result of this search is returned to management engine 1 8, which uses th result to learn about the 

5 current information in the database. If it is necessary to update the database, so that it con-ectly represents the current 
network connectivity, management engine 18 directly accesses memory 50 to effect the changes. 
[0046] As mentioned in the background to the invention, router 10 must be capable of performing Level-1 and 
Level-2 routing. Level-1 routing requires that an exact match of the network destination address presented by fetch unit 
30 be located in the database. There are many known database structures that allow an exact match to be realized. 

10 Level-2 routing, however, requires that if an exact match cannot be found, then the entry in the database which is the 
longest prefix of the network destination address should be located. This requirement linuts the number of different 
database structures that may be used. 

[0047] In the invention, the database stored in memory 50 is a particular type of tree structure known as a TRIE. 
Fig. 4 is an example of a fragment of such a TRIE structured database. Each node 110 of the TRIE may contain an 
IS array of sixteen pointers 1 05 to other nodes 1 10. A transition from the root of the TRIE to other nodes 1 10 is made by 
fragmenting the search argument (e.g a network destination address) into a sequence of segments (e.g. semi-octets); 
at each node in the TRIE, the next semi-octet in the sequence, having a value from zero through fifteen, is used to 
select one of the sixteen pointers. 

[0048] Referring to Fig. 3, the Current Node Register 41 points in turn to the nodes of the TRIE that are traversed 
so during a search. When a search is initiated, the Current Node Register 41 is set to point at the root of the TRIE. In the 
fragmentation of the search argument into semi-octets, the semi-octets formed correspond to the constituent digits of 
the NSAP discussed earlier. 

[0049] Referring to Fig. 4, nodes 1 10 with pointers are called parent nodes; the targets of the pointers are child 
nodes. Nodes in the trie which do not contain pointers to other nodes (e.g nodes 110-8. 1 10-9 and 1 10-1 1) are called 
25 "terminal nodes'. When the search reaches a terminal node, the search terminates. The search will also ternvnate if 
the pointer 105 selected by the next semi-octet points to a NIL node 1 15. (Although several nodes of Fig. 3 contain nil 
result values, for clarity, many other nil nodes have been omitted.) 

[0050] If a terminal node is reached by using all of the semi-octets in the seardi argument, then an exact match has 
been located in the database, and the terminal node holds a result value 120 which corresponds to the search argu- 

30 ment. If, however, a NIL node 115 is reached, or aternvnal node (e.g 110-8) is reached before all of the sequence of 
semi-octets has been used, then an exact match of the search object is not contained in the datat>ase. 
[0051 ] It is possible that a prefix of the search object is contained in the database; if this is the case, then at least 
one of the nodes traversed during the search will hold a resutt value 120. If no traversed node contains a result value 
(e.g. a search ending at node 1 10-1 3) then no such prefix exists in the database. If multiple nodes holcSng a result value 

35 are traversed, then multiple prefixes of the search object are contained in the database: the first such node traversed 
h Ids a resutt corresponding to the shortest prefix, while the last such node traversed holds a result corresponding to 
the longest prefix. The result corresponding to the longest prefix is returned as the result of the search. 
[0052] In many cases, the result value 1 20 for a child node and its parent node woukl be identical. In these cases 
(e.g., nodes 1 10-5 and 110-7), the child node does not hold a result value 120. If the search temiinates at a child node 

40 which does not contain a result value 120, the result value 120 of the most-recently-traversed prior node which has a 
result value is returned. In this way, a prefix stored in the database has its corresponding resutt value entered at only 
one node, simplifying maintenance. 

[0053] The preceding outlines how a TRIE structured database can be used to enat^e a search process to locate 
an exact match or the best-prefix match of a given search argument. Two forms of database compression are used in 
45 prefen^ed embodiments of the invention to reduce the amount of memory required to support the TRIE structure. These 
are known as pointer compression and path compression. 

[0054] Referring to Fig. 5, pointer compression is achieved by eliminating all of the nil nodes and all of the pointers 
to nil nodes from the trie. This is done by associating a pointer bit mask 1 15 with each parent node in the trie. The bit 
mask indicates which of the child nodes are nil, i.e. which of the pointers 105 point to nil nodes. Each bit in the mask 

so corresponds to one of the parent node's pointers, and is set if that pointer's target node is non-nil ; otherwise it is cleared. 
[0055] For example, the root node 1 00 is associated with bit mask 115-0. Two non-nil chikJ nodes 110-1,110-2 exist 
below the root node 100. Thus the root node bit mask 115-0 contains two T bits, one for each of the non-nil child 
nodes. All other bits of the mask are '0", indicating that all other child nodes of the root node are nil nodes. 
[0056] By comparing Fig. 5 to Rg. 4 it can be seen that the location of these two '1 ' bits correspond exactly to the 

55 location of the two pointers. In operation, before moving to a child node, the search unit 40 checks the parent bit mask 
1 15 to determine that the new node is non-nil. If the new node is nil, then the search immediately terminates without 
moving to the new node. 

[0057] Thus, refemng to Rg. 5A all nil nodes (and the pointers which indicate nil nodes) are eliminated from the trie. 
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saving storage space. Since it would be necessary to store only one nil node tor the entire TRIE, the memory savings 
are a result of the elimination of the pointers to the nil node or nodes. With pointer compression, the memory require- 
ments for a TRIE structured database may be reduced, but the calculation of current node addresses is more complex. 
To determine the address of new nodes, the search unit 40 must reference th parent's bit mask 115 to determine the 
5 number and distribution of pointers 1 05. If the bit mask indicates that the pointer corresponding to the next semi-octet 
in sequence is present, the bit mask is used to determine the location of the required pointer 1 1S in the list of pointers 
stored in the parent node. 

[0058] To introduce path compression, we first note that pointer compression has elinvnated nil nodes and pointers 
to nil nodes, but has not reduced the path length (i.e. the number of nodes) from the root mde to any given result node. 

10 Path compression is used to achieve such reductions. 

[0059] Path compression eliminates each node in the database that has: (1 ) only one child node, and (2) no result 
value or the same result value as its parent node. Node 110-7 of Fig. 5 meets these requirements. Referring to Rg. 6, 
after path compression, node 1 10-7 has been eliminated, and node 1 10-1 1 is drectly below node 110-2. 
[0060] In the preferred embodiment of the invention, the semi-octets corresponding to the (one or more) eliminated 

IS nodes is stored as a path-compression digit string 125. (In Fig. 6, the string is just one digit in length.) In operation, 
search unit 40 compares each digit of the path compression digit string 125 against successive semi-octets of the 
search object. Ail comparisons must indicate equality in order to proceed to the next node. If any of the comparisons 
fail, the search terminates at the node traversed immediately prior to the path compression digit string 125. By imple- 
menting path coopression in this way, the TRIE database may be used to locate the best prefix for a search object This 

20 would not be possible if the skipped semi-octets were not processed. 

[0061] In preferred embodiments, the path digit string is stored at the child node. If a string is present at a child 
node, the string must successfully be matched against subsequent semi-octets of the search obietA in order to arrive at 
the child node, if the string match fails, the failure is treated as though a nil pointer were selected at the parent node. 
[0062] This process is indicated diagrammatically in Rg. 6 by the interjection of the path digit string 125 between 

25 nodes 1 10-2 and 1 10-11 . The existence of a path dig'it string in a node is indicated by a flag bit at that node, if this flag 
is set. the node has an associated path digit string; if not set, the node contains no such string. As can be seen in Rg. 
6. flag bits have been included with each of the nodes 110. 

[0063] Thus, pointer compression and path compression result in reduced memory requirements for a TRIE struc- 
tured database while allowing exactly the same functionality as an uncompressed database. 

30 [0064] The optimized TRIE structure described above requires some enhancements to allow it to properly process 
an OSI network destination address (NSAP) in the context of DECNET Phase-V routing. The mechanisms described 
so far provide for searches capable of finding exact matches and of finding maximal length prefix matehes of the input 
address. These mechanisms must be enhanced in order to: (i) allow stored address prefixes which are shorter than the 
IDP to match addresses whose leading significant ID I digits are identical to the significant IDI digits of the stored prefix; 

35 and (ii) allow recognition of area addresses, prompting a transition to a Level-1 database (since network addresses are 
variable in length, it is possible tor a stored area address to match the leading semi-octets of an input address without 
the input address meeting the requirement that it have precisely fourteen semi-octets remaining after the matching area 
address). 

[0065] As an example to illustrate the requirement for the first enhancement, consider a prefix 37-123 stored in the 
40 database. In this example, the AFI 'Z7" indicates that the IDP length is 16 digits (i.e. the IDI length is 14 digits). The 
digits that follow the hyphen, namely '123' are the leading significant digits of the IDI. Every address presented, for 
example by fetcti unit 30. whose AFI is equal to "Sr and whose leading significant IDI digits are equal to '123' should 
traverse a node in the TRIE that indicates a recognition of the prefix 37-123. Consider the fbltowing three oidresses. as 
presented by fetoh unit 30: 

45 

(i) 370000000001 23456789abodef 

(ii) 37000000000001234S6789ab 

(iii) 37000000000000123456789abcdef 

50 [0066] The first address has significant IDI digits equal to '12345". The second address has significant IDI digits 
equal to "123". The third address has significant IDI digits equal to "12". Thus the first and second addresses contain 
the prefix 37-123 but the third does not. Because there can be any number of padding digits in the IDI (between zero 
and eleven) in an address containing the above prefix 37-123, an unmodified address recognition trie would have twelve 
different branches for the prefix 37-123. each representing a different number of pad digits (in the range 0 through 1 1) 

55 and each leading to a distinct node labelled with the result corresponding to prefix 37-123. However, this structure is 
botti inelegant and mem ryintensiv . 

[0067] A prefen'ed way to store the prefix 37-1 23 is t treat all IDI pad digits as discardable. Then the node storing 
the result corresponding to the prefix 37-123 is reached after the semi-octet sequence 3.7.1,2,3. In preferred embodi- 
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ments. pad digits are easily discarded, without changing the basic operation of the TRIE. Pad digits are discarded by 
arranging the TRIE such that ID) pad digits select pointers 105 that point back to the parent node, rather than to a dif- 
ferent child node. (As a consequence of this seK-referencing. a node which corresponds to the first digit of the IDI can- 
not have an associated path compression digit string). 

5 [0068] Without further nxidification. the third address in the example list above, once the IDI padding dgits are dis- 
carded, will also travel along the path 3.7. 1 ,2,3, and felsely indicate that it contains the prefix 37-1 23. To overcome this, 
two mechanisms are added to the basic TRIE : fi) a counter, the Remaining IDI Length Counter 43 (Rg. 3) which main- 
tains a count of the remaining IDI digits of the input search address: and (ii) a seventeenth pointer, called the DSP 
pointer, which points from a parent node representing a digit of the IDI to a child node representing the first digit of the 

10 DSR The seventeenth pointer is accessed when the IDI Length Counter is deaemented to zero. 

[0069] In operation, during a search of a network destination address, nodes in the TRIE reached after processing 
the second semi-octet of the address store the corresponding IDI lengths. That an IDI length is stored is indicated by 
an additional flag bit. Search unit 40 recognizes that the flag is set. reads the IDI length stored at the node and transfers 
it to the IDI Length Counter 43. Each time that a semi-octet is processed by search unit 40 (including the padding digits 

IS of the IDI). the IDI Length Counter 42 is decremented by one. When the IDI length is decremented to zero, the entire 
IDI has been processed. At this point, the seventeenth (i.e. DSP) pointer will be selected at the current node. Note that 
if the AFI value is 48 or 49. then there is no IDI; thus nodes in the database conesponding to these AFI values must 
provide an indication that the IDI Length is zero and must also provide a seventeenth pointer for immediate transition 
into the DSP domain of the database. 

20 [0070] Another example illustrates the need for a second ertfiancement of the basic TRIE, relating to recognitkxi of 
area addressea Consider that a router is operating in a OECNET area defined by the area address 
471 23400000000abod. By refemng to Figs. 2A and 2B, distinct fields of this area address can be recognized, as shown 



[0071] Next, consider that the following three network destination addresses are presented to search unit 40 by 
fetch unit 30 (having been extracted from packets traversing the network): 



(0 47123400000000abcd0123456789ab00 
35 (ii) 47123400000000abcd0123456ff 

(ill) 47123400000000abcd01234567ff89ab00 



[0072] The leading digits of the first address exactly match the area address of the area In which the router resides. 
The remaining digits can be interpreted as two fields : (a) a 12 semi-octet field 0123456789ab which is the DECNET 
40 Level-1 ID part of the address and (b) a two semi-octet field 00 which is the DECNET SEL field. Thus the packet from 
which the first address was extracted should be regarded as destined for an End System vinthin this area, and should 
be routed by DECNET Level-1 routing. An exact match for the Level-1 ID part must be found in the database so that the 
packet may be forwarded to the correct End System. 

[0073] The leading digits of the second address also exactly match the area address of the area in which the router 
45 resides. When the remaining digits are examined, there are nine. Thus, the remaining digits cannot be interpreted as a 
Level-1 ID part plus a SEL part; accordingly, the packet containing the specified destination ^ress is not destined for 
an End System within the area in which the router resides. The incidental match between the leading digits of the des- 
tination address and the area address must be discounted: the longest prefix of the entire address should be sought In 
the Level-2 datatiase. and the packet routed by Level-2 routing. 
50 [0074] Similarly, the leading digits of the third example address exactly match the area address. Here again, the 
match must be discounted because the remaining digit string would be too long to interpret as a Level-1 ID part plus a 
SEL part Again, the longest prefix of the entire address should be sought In the Level-2 database, and the packet 
r uted by Level-2 routing. 

[0075] When a search 1 f the above three addresses is eHected in a TRIE database, it is clear that they will all follow 
55 the same path until and beyond the node in the database corresponding to a match with the area address. As explained 
abov , it is necessary for the first address to follow a different path once a match with th area address has occupied - 
at this point a branch sfiould be made to a sutstrie which is a logk»lly separate database holding all Level-1 addresses. 
[0076] To allow for branches from the Level-2 database into the Level-1 database, a further, eighteenth, pointer is 
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added to each TRIE node. In order to use the eighteenth, or 'Level-1", pointer, two adcfitional mechanisms are added 
to the basic TRIE: (I) a counter and comparator in a Current Remaining Length register (CRL 42 in Fig. 3). which indi- 
cate that there are exactly 14 semi-octets of the search object remaining; and (ii) an additional flag bit stored at each 
node, which Is set only at the node or nodes in the TRIE that are reached after an exact match against the area address 
5 (i.e. the area address of the area in which the router resides). This flag bit will be refened to as the 'Level-1 transition 
possible" bit. 

[0077] The CRL register 42 Is loaded with the length of the entire search object (i.e. number of constituent semi- 
octets) when it is first presented by the fonvarding engine 14 to the fetch unit 30. Although not part of the address per 
se, the length of the address is included In the packet header as specified in ISO 8348/A02. The CRL register 42 is 

10 decremented by one each time that a semi-octet is processed by search unit 40. 

[0078] Only when the Level-1 transition possible bit is set aod the CRL register indicates that there are exactly fbur- 
teen semi-octets remaining, will the eighteenth pointer be selected. The child node of the eighteenth pointer will be the 
root node of the Level-1 database. Note that the neoct semi-octet from the search otqect will not be consumed until after 
ttie transition into the Level-1 database. 

IS [0079] In operation, if a transition into the Level-1 datat>ase takes place during a search of a network destination 
address, then the packet containing the address is destined for an End System within the area in which the router 
resides. As explained above, an exact match must be found in the Level-1 database for the Level-1 ID part of the des- 
tination address. That is. any former test prefix" results that may have been acquired during the initial traversal of the 
Levet-2 database are irrelevant once a transition into the Level-1 database has taken place. If an exact match cannot 

20 be found in the Level-1 database, the packet must be discarded since its destination address does not exist. Results 
returned from a search in the Level-1 database must therefore conespond to exact matches; otherwise, a nil result 
should be returned. 

[0080] Refen-ing to Fig. 7. four nodes of a partial database are illustrated, and will be used as an example of the 
progression of a search through a database in accordance with the invention. In the example, path compression has 

25 been used, but pointer compression has not. 

[0081 ] The search progresses as follows. An address search an^ives at node 1 1 0-A. The Path String and I D I flags 
are checked upon arrival. Since both flags are not set ("Q") in node 1 10-A, node 1 10-A does not require special process- 
ing. Therefore, the search uses the next semi-octet to index the array of pointers and proceeds to a child node. In the 
example, the fetched senti-octet has a value of 7, so the search unit 40 uses pointer sevwi to move to node 1 1 0-B. 

30 [0082] Upon arrival at node 1 1 0-B, the search unit 40 sees that the Path String flag is "0", and thus node 1 1 0-B has 
no associated path digit strings. However, the "101" flag is '1'. This indicates that node 1 10-B contains IDI information 
130 which contains the length of the IDI field. (It can be assumed, therefore, that in all probability the semi-octet "T that 
was just processed was the second AFI digit) The search unit 40 retrieves this information, and loads the value into the 
IDI length register. This register will then be decremented by one every time a semi-octet is processed. The next semi- 

35 octet (the first digit of the IDI field) is fetched, and the search progresses. 

[0083] In the example search, the AFI for the search path spedf ies that the IDI fill digits equal the decimal value 0. 
To ignore fill bits, the pointer in node 1 10-B corresponding to a semi-octet value of 0000 points to node 1 10-B. There- 
fore, as long as IDI fill digits are processed and the IDI length register is not decremented to zero, the search does not 
leave node 1 10-B. The search ordinarily leaves the node only after a non^ill (non 0000) digit is parsed. 

40 [0084] The Art of Computer Programming, vol. 3, pp. 490-498: "Sorting and Searching" describes a method of 
buikling a binary trie which avoids one-way branching by including in each node a number of bits to skip over before 
making the next test. 

[0085] In one path the search couU subsequentiy follow. It woukf anive at node 110-D, whose Path Sb^ing flag is 
set. This indicates that node 1 10-D contains a path digit st-ing 125. Before further processing at node 110-D. the search 
45 unit retrieves the path digit string 125 and compares the elements of the string with the sut>sequent semi-octets pre- 
sented to the search unit Only after the entire digit string has been successfully corrpared does the search unit furttier 
process node 110-D. 

[0086] During further processing, the search unit discovers tiiat the result field contains the value xyz, indicating 
that the semi-octets processed so far are a prefix having associated routing information (tiie prefix is associated with 
so the result value xyz). The result value xyz is stored by tiie search unit. If no further result is encountered during tiie 
search, then the value xyz will be returned as the search result, indicating that tiie associated prefix is ttie best one 
stored in the database for ttiis particular search argument. 

[0087] Subsequently, the search unit discovers that the DSP flag is set. This indicates ttiat node 1 10-D has a non- 
nil seventeenth pointer which points into the DSP domain of the database. (In an actual implementation, the DSP flag 
55 bit may be implemented by the use of nil and non-nil DSP pointers rattier ttian witti an explicit bit.) If and only if the IDI 
Lengtti register has been decremented to exactly zero can this seventeenth pointer be accessed. If this is the cas , ttie 
search will continue in the DSP domain of the database. If ttie IDI Length has not been decremented to zero, ttie next 
semi-octet of the search object is used to select one of the first sixteen pointers at node 1 10-D. 
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[0088] In another path the search couW follow, rt would arrive at node 1 1 0-C. At node 1 1 0-C. the Level- 1 flag is set. 
This indicates that the semi-octets of the search argument that have fc>een processed so far exactly match the Area 
Address of the area in which the nsuter resides (i.e. the router that this database is contained within). If. at this point, 
there are exactly fourte n semi-octets of the search argument remaining, then the search argument will be regarded as 
5 an address of an End System within this DECnet Area; accordingly, the eighteenth pointer will be selected at node 1 1 0- 
C and the search will resume in the Level-1 sutitrie. 

[0089] Thus, each node contains several flags and possiile pieces of information. Each node may contain (where 
pointer compression is used) an 18 bit pointer bit mask, 16 pointers to subsequent nodes (or to itself), a DSP pointer, 
a Level-1 pointer, a path compression string and an indication of its length, result information, and 101 length infbrma- 
10 tion. 

Detailed Description 

[0090] A specific implementation of a search unit is described here. The design is driven by the desire to use a min- 
ts imal number of readily available components and to minimize the power consumed. 

[0091] The desire to use a minimal number of readily availat>le components must be stressed. We have descritied 
above minimizing the use of memory through the techniques of path compression and pointer compression. By looidng 
at these techniques from an implementation viewpoint, it may t>e decided that path compression shouM be imple- 
mented but pointer compression should not Although pointer conpression allows a reduction in the amount of memory 
so required, it also requires that the memory be segmented into small pieces that are managed by some memory man- 
agement process. Additional overhead includes extra logic to read and decode the bit mask, multi-lei^el hardware 
adders to accumulate the significant bits of the mask and additional adders to then add this accumulation onto the 
address of the block of compressed vectors. 

[0092] This overhead not only requires the use of extra control logic, but also reduces the potential cycle time of the 
25 machine because the extra k>gic is in the critical path of every cyde. By dispensing with pointer compression, the con- 
trol logic is greatly simplified and the cyde time improved. The addition of more memory does not increase the compo- 
nent count to what it would be with pointer compression in effect. Different implementation requirements, especially 
those with larger databases, may make memory usage the greater consideration. Under these different requirements, 
pointer compression may be used effectively. 
30 [0093] The recognition engine here is designed to fit Into a 'Widget-a' system. This system has a small 3-slot back- 
plane, one slot of which is intended for some form of address recognition engine (ARE). The backplane bus is essen- 
tially that of a host 68020 Motorola microprocessor, but with only 24 rather than 32 address bits. The timing of the bus 
has been slowed down relative to the CPU in order to simplify bus interface logic. It is still possible to guarantee no more 
tfian two wait states (200 ns cycles) when the CPU accesses the ARE if performance requirements mandate this. 
35 [0094] To permit maximum flexibility, the implementation supports eight different root nodes for initiating searches. 
The proper root node may be selected for searching different address formats or performing varfous types of general- 
ized searches. 

Central Components 

40 

[0095] Referring to Fig. 3, there are four tiasic components implementing memory 50 of recognition engine 20, and 
a number of registers and elementary state machines operating concun-ently (collectively known as the search engine 
40). The four central memory components are described t)elow. 

45 DRAM Pointer Memory 

[0096] The core of the recognition engine 20 consists of the node pointer menxiries and the 'cun-ent node + current 
digit -» next node" search unit 40. Refemng to Rg. 8. the pointer memories are implemented by an array of DRAM 
memories 200 supported by a small amount of control logic. A 16-bit state vector (i.e. the node address can-ied by lines 

so 202) and a 4-bit digit (carried by lines 204) requires a DRAM memory 200 organized as 1 M X 1 6 (20 bits X 1 6 bits) and 
allows. 64K states (or nodes) to be implemented. This allows at least 32K addresses to be recognized; in fact, t)y 
asserting that there are 32K nil nodes (with no "next state" chiW nodes) and 32K non-terminal nodes with next states, 
then only half as much physical memory is required, i.e. only one Megabyte (organized as 512K X 16). A DRAM array 
200 of this size requires only eight (256K X 4) chips, or (looking ahead in memory technology) only four (1 M X 4) chips. 

55 [0097] The tMsic machine cycles DRAM array 200 at the highest possible speed, and this speed largely dictates 
the search time for any given address (assuming that ther is time to generate the DRAM address for the next cycle 
after the data for the cun^ent cycle has become valid: this can easily be managed, as the next DRAM address is essen- 
tially a concatenation of the 16-bit pointer data with the 4-bit digit). 
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[0098] A node is thus defined by a 1 6-bit address carried onlines 202. If the most significant bit of the address is 
equal to zero, the node is defined (in this implennentation) to be terminal (with no next-node pointers), but is non-termi- 
nal (i e. has next-node pointers) othenwise. 

5 Fast Static RAM 

[0099] All 64K nodes must provide the extra information C>n addition to the pointers to next nodes) that has been 
alluded to earlier. To supply this information, a fast 64K X 16 static RAM 210 is accessed concurrently with DRAM 
pointer array 200. The static FIAM provides the following information about the cun'ent node: 

- NIL Rag: set if node is NIL (and processing should halt); 

Path String: if a path string stored at the node, a non-zero length is stored in SRAM; 
Result Rag: set if node contains a new result value; 
IDI Rag: set if AFI done (and node contains IDI length); 
75 - Level-1 Rag: Level-1 transition possible (DECnet area recognized at this node). 

[0100] The mapping of this information into physical memory will be more fully discussed later, under the heading 
"Memory Map". 

20 Path Digit String Memory 

[0101] The Path Digit Siring Memory 220 Is accessed concurrently with the DRAM pointer anay 200. String storage 
space is allocated for each of the 64K nodes. The number of digits stored at each node is indicated by the string-length 
field that is stored in the fast static RAM 210. The nmimum number of digits that can be stored at a node is determined 
25 by the address of the node. There are three different maxima: 48 digits, 16 digits, or zero digits. The node address on 
lines 202 determines which maxinum is in effect: the assignment is partly hard-wired and partly logic-programmable 
and is designed to minimize the amount of physical memory. 

[0102] The implementation supports 48 digits of storage at 4K nodes and 16 digits of storage at 52K nodes. 8K 
nodes have no string storage space. The total memory required is 1M digits which is 512K octets or four (1M * 1) chips. 
so Again, with next generation 4Mbit DRAMs, only one chip is required. 

[0103] According to the invention, techniques are used to map the node address (16 bits) and the value of the 
string-digit-counter (6 bits) onto the 1M*4 DRAM address lines (20 bits). In effect, the mapping changes as a function 
of the node address lines. This will also be further detailed in the section entitled "Memory Map* below. 

35 PSP Ppintef Memory 

[01 04] The DSP Pointer Memory 230 is also accessed concurrently with the DRAM pointer array 201 . Only 32K of 
the nodes, known as "transition nodes" (where the node address MSB is equal to one) can access this memory. The 
memory requirement is thus 32K * 1 6. equating to only two chips. The DSP pointer accessed at a node will only be used 
40 when the end of the IDP is recognized; i.e. after an IDI search when the CRL counter is decremented to zera 

Control Logic 

[01 05] Central components of the control logic for the ARE are described below: 
45 [0106] The timing generatv (not shown) is responsible for the correct sequencing of the memory arrays; e.g. RAS 
and CAS control, refresh timing, cyde-to-cyde control, etc. 

[0107] The fetch unit 280, 290 is responsible for supplying semi-octets of the address to the search unit on a 
demand basis. The semi-octets are extracted in onfer from the search address as supplied (in this case) by the host 
CPU to a register file 270. The semi-octets are presented to the search engine and are synchronized to its timing gen- 
50 erator. Interiock is provided, in that if the search engine requires a semi-octet that the host CPU has not yet supplied, 
then the search engine is stalled until the semi-octet is ready 
[01 08] There are three digit-counters that are capable of concurrent operations: 

[0109] The cun^ent remaining length (CRL) counter 240 is loaded with the semi-octet length of the entire search 
argument when it is first presented by the CPU to the recognition engine. The counter is decremented by one whenever 
55 a digit is consumed (processed) by the search engine. If the count reaches zero, it is an indication that the search must 

[0110] The CRLcounter 240 also provides a signal on line 310 to the Cyde Controller (not shown, seedescription 
under Cyde Controller, below) that exactly fourteen digits of the search argument remain, for the purpose of making a 
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Levei-1 transition. 

[0111] The Remalning-IDI-length lOIL counter 250 Is loaded with the length of the IDI that is dictated by the AFI 
value (as specified In the DECNET Phase-V routing specif Icati n). The loading takes place under command of a control 
bit set at a node (the node immediately following the second AFI digit). This counter is also decremented by one when- 
5 ever a digit is consumed by the search engine; if the count reaches zera then the next machine cyde must be a DSP 
transition cycle. The value of the IDIL counter is reset to -64 each time a new search starts. The value -64 indicates 
"length not loaded". 

[01 1 2] Note that the ARE may be used to search arbitrary words in an arbitrary database: it is not necessary to load 
the IDIL counter (if it is not loaded, then a DSP transition cannot and will not be made). 

10 [01 1 3] Strlng-diglt-counter 260 is always reset to zero upon entering any node. The counter is incrwnented at the 
end of e^ery machine cyde that is a string-digit comparison cycle: thus this counter mil remain reset to zero until a node 
is reached at which a constant string is stored. If a string of length N digits Is stored at a node, then the string dgit coun- 
ter will increment from the value zero through IM minus one, as N string digit comparison cydes take place (assuming 
that the comparisons are successful). When the string-digit-counter reaches the value N. no further comparisons take 

IS place: the counter is reset to zero in the next machine cyde. 

[0114] The cycle controller (not shown) might be regarded as the central intelligence of the search engine. It sees 
status information about the current machine cycle and control information from the Node Control Word on bus 330. 
The cycle controller decides how to continue with the present cyde (i.e. whether to compare a string digit select a 
pointer for another cycle, or stop): whether to start another machine cyde, and, if so, whether to use a new digit and/or 

X a new node address. The cycle controller also generates signals to various registers and counters. 

[01 1 5] Status registers 245, 248 are used to save the node address of the most recently traversed prefix (i.e. the 
best prefix so far), and to save useful past history; in particular, whether DSP and/br Level-I transitions have taken 
place, whether the "best preSix so far has ever been loaded, whether the remaining-IDI-length register 250 has been 
loaded. 

25 [01 1 6] Host CPU access transceivers 252, 254. 256, 258, 262 allow the host CPU to access the internal memories 
of the ARE for maintenance purposes. The concurrent operation employed by the search engine is disabled for this pur- 
pose: the memories collectively appear to the host CPU as a single database with what is intended to be a 'programmer 
friendly' structure and address map. In this structure, ail of the data associated with a particular trie node appears within 
a uniformly organized data structure. 

30 

Functional Interfaces to the ARE 

[01 1 7] There are tsasically three functional interfaces to the ARE. These are: 

35 1 . Search interface • packet fbnrarding; 

2. Search interface - maintenance: 

3. Maintenance interface 

[0118] The first of these is straightforward; it may be considered as the primary reason for constructing an ARE. In 
40 tNs mode, forwarding addresses (and/or other addresses, e.g. datalink addresses) that have been extracted from pack- 
ets traversing the network are presented to the ARE. The ARE performs a look-up on the addresses and returns for- 
warding information to a fonivarding engine (in this case, the fonwarding engine is the host CPU). 
[01 1 9] The second and third functions are both concerned with maintenance. The search interface for maintenance 
is very similar to the search interface for packet forwardng. In this case though, addresses presented to the ARE are 
45 not necessarily complete addresses. Typically, address fragments are presented in order to determine what structural 
changes to the database are required to add, delete or modify oitries. A slight modification to the operational behavior 
of the ARE is used for maintenance searches, to cope with the subtle differences of the address semantics. 
[0120] The maintenance interface involves no searching; indeed, the search engine is inhilMted from operating in 
this nxxJe. The address, data and cortrol paths within the ARE are completely restructured so that the internal memo- 
so ries of the ARE are made visible to the host CPU. Each memory component of the ARE is then accessed by decoding 
the CPU address lines, rather than by a search engine. 

[0121] TTie CPU interface to the ARE for the purpose of performing a search is functionally the same, whether the 
search is for the purpose of forwcuding a packet or of inspecting the database. The interface is supported by direct 
memory maw)ed control, status and data registers. The control registers allow the mode of operation of the ARE to be 
55 contrdled, while the sfatus registers alkw the operation of the ARE to be monitored. The data registers provide a 
means of presenting an address t th ARE and of reading a search result 
[01 22] The oontirol registers provide control of the following: 
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1 . Select search mode or maintenance mode; 

2. Select address format mode (influences the interpretation of results); 

3. Inhibit Level-1 transitions (for maintenance searches); 

4. Inhbit lOP to DSP transitions (maintenance searches): 
5 5. Select poll or stall until search result complete; 

6. Control parity protection of ARE memories. 

[0123] The status registers allow the fbllcwing to tie monitored: 

10 1 . State of search engine (reset searching, halted): 
2. State of address input to search engine. 

[01 24] The data registers are broken into eight blocKs. Each block con^esponds to a search initiation at one of eight 
root nodes, but otherwise the blocks are functionally identical. (Some of the eight root nodes may begin datatiases used 

>5 for different types of searches, others may con-espond to the start of, for example, Level-1 branches of an address 
search database.) A register block is 64 bytes in size; the second 32 bytes being an alias of the first 32 bytes O e. using 
the same physical memory). Of the 32 physical bytes in a UocK up to 24 bytes may be used to write an address to the 
ARE that is to be searched. The first byte must be a length indication of the entire length of the address to be supplied. 
(Note that this fbrmat allows direct extraction from ISO 8348/AD2 packets traversing the network). The choice of which 

so alias to write the address into detemiines whether the length byte will be interpreted as the length of the address meas- 
ured in octets or measured in semi-octets. An octet length allows direct extraction from a ISO 8348/AD2: a semi-octet 
length allows artiitrary length prefixes to be searched, which is particularly useful for maintenance searches. The max- 
imum length address that can be searched (in the current implementation) is 23 octets or 46 semi-octets. 
[0125] The search operation will start as soon as the first four octets of the search address are written. An interlock 

25 mechanism prevents the search engine from using octets five through eight until these also have been written by the 
CPU. Similarly, the interlock is provided for the remaining octets in groups of four. Note that four octets may be written 
with one write operation, as the ARE has a 32 bit data interface. Note also that the search address, including the length 
t>yte. must be padded out with arbitrary data if necessary to be a multiple of four octets In total length. 
[0126] The search result will be written by the search engine into the same block that the search address was writ- 

30 ten Into by the CPU. The status of the search engine can be monitored by the CPU (using status registers 248) to deter- 
mine when the search resutt is valid; or, alternatively, an Interlock can be enabled so that the CPU is stalled when 
reading the result until the resuH is valid. 

[0127] The search result is a group of eight bytes. The group is broken up into the following fields: 

35 1. Summary (1 byte); 

2. Result-so-far (2 bytes); 

3. Got-so-far (2 bytes); 

4. Internal counter values (3 bytes). 

40 [0128] The summary byte provides rapid detection of search success or failure. In the case of a search success, in 
packet fbnvarding mode, the result-so-far provides an index into a renrate table (i.e. external to the recognition engine) 
that provides all the necessary forwarding information. In the case of search failure, particularly in maintenance 
searches, the summary indicates the reason for failure. The other fields provide sufficient information to allow the ARE 
database to be easily and rapidly updated by the CPU if desired. Further details on these registers and their interpre- 

45 tation is provided In Appendix B. 

[0129] The CPU can put the search engine into "maintenance'' mode (rather than 'search' mode) by writing to a bit 
in the control register. 

[0130] When in maintenance mode, the search engine does not function and all previously acquired status (except 
for the datatiase) is lost. In this mode, all of the internal memory of the ARE is mapped into some part of the CPU 
50 address space. This part of the address space is not accessitile unless the search engine Is in maintenance mode. 
(This greatly simplifies arbitration for the internal memories and buses, and is consistent with the goal of minimum parts 
count). 

Memory Mao 

55 

[0131] The mapping is organized as 64K fconsecutiv nodes; each node is 256 octets in size. Thus the total vir- 
tual space spanned by the memories Is 16 Mbytes. The memory map of each node is shown in Rg. 9; the physical 
memory parts making up various fragments are shown in parentheses. 
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[0132] In this memory mapping scheme, a resource within a node N that has an offset (as shotm in Rg. 9) of m is 
addressed with the CPU-supplied address: 
N*2*8+m: i.e. the concatenation of N followed by m. 

[0133] Some hard rules are enforced with the mapping scheme of Fig. 9. The rules are: 

5 

1 . All nodes with address less than 8000 hex are terminal nodes. A path digit string may be stored at some terminal 
nodes, but a transition to another node is strictly not allowed at any terminal node. 

2. There are 4K of nodes with provision for up to 48 semi-octets of path digit string storage. Half of these nodes are 
terminal nodes (Nodes 7800 - 7fH hex) and the other half are transition nodes (Nodes fSOO - ffff hex). These nodes 

10 are provided primarily for storing Level-2 entries in the database. 

3. There are 52K of nodes with provision fOr iq? to16 semi-octets of path digit string storage. Half of these nodes 
are terminal nodes (Nodes 1000-77ff hea^ and the other half are transition nodes (Nodes 9000 • f7ff hex). These 
nodes are provided for storing Levet-1 entries, datafink addresses and other entities of total length not exceeding 
16 semi-octets in the database. 

>5 4. There are 8K of nodes with no provision for path digit string storage; this is a consequence of giving up what 
would be storage for 16 semi-octets and allocating the memory to the 4K of nodes with 48 semi-octet storage. 
These nodes can be used, however, for example for those nodes where an IDP count is located within a node (and 
string storage is thus not allowed) or fbr any node where a constant string is not present 
S. There is only one Level-1 pointer. It is accessible at all transition nodes. The pointer should t>e set to point to the 

20 root of the Level-1 branch of the database. 



[01 34] The basic theory of operation for a search according to the invention is described here. It is assured that the 
2S CPU will have built a valid database trie structure before searching starts. 

[0135] Refen-ing to Rg. 8, a register file 270 is used to provide the necessary memory for holding the search 
address that the CPU provides to the ARE. and for holding the result to be returned to the CPU. The register file 270 is 
dual ported, allowring concun-ent access by the ARE and the CPU. This improves performance and obviates the need 
for arbitration control logic. 

30 [01 36] Searching is initiated when ttie CPU writes the search argument into the register file 270. In particular, when 
the CPU writes to the third octet of a longword entry (this will usually be concurrent with writing the whole longword) a 
hardware flag is set that indicates that the associated longword contains valid data. At the same time, the block chosen 
Is remembered for the purpose of starting the search at the correct root node; and the alias used within the block is 
remembered so that the first octet can be property interpreted as an octet or a semi-octet length. 

35 [0137] There are six "ready" flags, one for each of the first six longwords in a block. The flags are cleared in one of 
two ways: either the CPU explicitly clears them by isaiing a maintenance mode command (and by subsequently return- 
ing the mode to search mode); or the CPU overwrites the search argument with a new search argument. Ovenvriting 
the first longword will generate an ARE system reset but will leave the first longword "ready" flag set. The latter method 
allows multiple searches to be performed with little or no control overhead. 

40 [01 38] Control logic 290 around the register file 270 tests the flags and routes the longwords (when they are ready, 
i.e. valid) to a multiplexer 280 built in a couple of programmable logic anays. The multiplexer 280 output feeds semi- 
octets of data to the search engine on bus 204 and generates a "data ready" indication. If "data ready' Is stalled 
(because the CPU cannot fill the file as quickly as the search engine can empty It) then the search engine will also stall. 
The register file control logic 290 looks for a 'next diglT signal on line 282 from the search engine whereupon it will drive 

45 the next semi-octet onto the multiplexer 280 output bus. Synchronization of the 'ready' flags occurs in parallel with the 
serialization of each longword into eight semi-octets; tNs pipelining technique alfows the overhead of synchronization 
tob completely recovered. 

[0139] The first octet in a register file block (i.e. the octet with the lowest address) is routed to the CRL register 240. 
If the length is to be interpreted as an octet-length (rather than a semi-octet length), then its value is doubled so that the 

50 CRL register 240 always holds a semi-octet count The second and subsequent octets are all routed to semi-octet mul- 
tiplexer 280. For each octet entering multiplexer 280. the most significant four bits will be output first followed by the 
least significant four bits. When each semi-octet is valid on the semi-octet bus 204, control logic 290 signals "ready" on 
line 292. If and when the Next Digit signal on line 282 is asserted, the CRL register 240 is decremented by one and a 
semi-octet is consumed. After all of the semi-octets have been output onto the semi-octet data bus (lines 204). the 

55 semi-octet generator logic will still indicate "ready" on line 292 to the search engine even though there are no more 
semi-octets. If 'next digit' is asserted on line 282 to the control logic 290. undefined data will be driven nto the s nv- 
octet data lines 204. However, the data will be qualified with an indication that the 'remaining length is zero' (RLEQO) 
on line 284. 
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[0140] As soon as "data ready' is indicated for ttie first semi-octet on semi-octet data bus 204. 9\e first machine 
cycle starts. The cyde starts by driving a valid digit onto the semi-octet data bus 204 and a valid node address onto the 
node address bus 202. The node address used for the first machine cyde 0-e. the root node address) will be one of the 
eight values 8000 through 8007 (hexadedmal) as determined by which of t}locKs zero through seven was loaded with 
the search address. 

[0141] A machine cycle is equal- (in time) to a DRAM cycle. The master crystal osdilator frequency is selected to 
minimize the cycle time but also guarantee proper DRAM operation under worst case conditions. IHs time is approxi- 
mately 208 nanoseconds per machine cycle. 

[0142] A general case machine cyde will be considered. This geneal case indudes the very first machine cyde. 
Machine cydes will execute back-to-back with no delay, unless there is a delay in obtaining the next digit (indicated by 
'data ready' being negated), caused by an unduly slow load of the register file. Approximately every 1 5 microseconds, 
a machine cyde will be donated to the cause of refreshing the DRAM arrays: no search progress is made during these 
refresh cydes. 

[0143] At the beginning of the machine cyde, the cun-ent node address is driven onto the node address bus and 
the cun-ent search address digit is driven onto the digit data bus. Concurrent access is made to the following memories: 

1 . A next-node pointer is fetched from pointer DRAM array 200; 

2. A DSP pointer is fetched from DSP pointer static RAMs 230; 

3. A next string digit is fetched from path digit string DRAM array 220; 

4. String/IDI length and control bits fetched from fast static RAMs 210. 

[0144] The next-node pointer and the DSP pointer both contend for node data bus 206, as does the Level-1 pointer 
data register 340. Control logic will dedde which contender is enabled onto the bus (see below). 
[01 45] The next string digit is enabled onto bus 208, and is compared with the current search digit on bus 204 (even 
if there is no string digit). The comparison is made t>y comparator 300. The comparison status, indicated by the 'digit 
match' signal on line 302. is made available to the control logic. 

[0146] The control information fetched from the fast static RAMs is available relatively early in the machine cycle, 
after approximately 60ns. By this time, additional status information is available from registers and control drcuits within 
the search engine. This additional status indudes: 

1 . CRL = 14 (line 310): 14 semi-octets remain, Lsvel-I transition possible. 

2. CRL s 0 (line 284): last digH has been used. 

3. IDIL o 0 Oine 312): IDIL has been decremented to. or is being loaded with zera 

4. 'IDI loaded* (line 314): prevents IDI reloading at a fill digit looping node. 

5. 'string exhausted' (line 316): indicates that the value in string digit counter 260 is equal to the 'string length' field 
from the fast static RAM (canled on bus 320) - both values are 0 if there is no string stored. 

[0147] The setting of the above status bits together with the Node Control Infdmiation carried on bus 330 from fast 
static RAM 21 0 determines what will happen the second half of this machine cyde. Basically, the machine cyde will be 
one of four types: 

1 . String digit comparison cycle (string cycle): 

2. Level-1 transition cyde (Level-1 cyde); 

3. IDP to DSP transition cyde (DSP cyde); 

4. Normal next-node pointer cyde (pointer cycle). 

[01 48] Types 2. 3 and 4 may be cdlectively referred to as a 'normal cyde"; there is nothing abnomial about a 'string 
cycle" other than that the search engine does not progress to a new node. 

[0149] The following description of how the cycle type is dedded is given in a sequential form; the search engine 
however makes all these decisions concurrently: 

[0150] If "load IDI length' (a node control bit on bus 330) is asserted and 'IDI loaded" (on line 314) is false, then the 
IDI length will be loaded into this register and "IDI loaded' will be asserted. 'ID! length = 0" (on line 312) will be asserted 
whenever the register contents are zero and "IDI loaded" is asserted. 

[0151] If 'load IDI length' is negated, string comparison may be called for. If 'string exhausted" (on line 316) is 
asserted, this is a "normal cyde": otherwise, this machine cyde will be a path string digit comparison cycle. In the latter 
case, the next-digit and DSP pointers are ignored. The machine waits until the "digit match" flag (on line 302) is valid 
(after about 180ns) and then, if it is negated, a string mismatch is declared and the machine stops. If "digit match" is 
asserted, then 'next digr (on line 282) is requested from the semi-octet generator; string count register 260 is incre- 
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merited; IDI length register 250 and CRL register 240 are deaemented. The machine ends the current cyde and starts 
the next cycle without changing the value on node address bus 202. 

[01 52J String comparison cydes will continue to take place until there is a digit mismatch or until "string exhausted" 
is asserted on line 316, indicating that all digits have successfully been matched, or until."CRL = 0" is asserted on line 

5 284 by CRL counter 240 (considered also to be a mismatch). Whenever 'string exhausted" is asserted on line 316, 
string count register 260 will be reset to zero in the next machine cyde (even if it is already zero). 
[0153] A "string cycle" will not take place if "string exhausted' is asserted during the machine cycle, because 'string 
exhausted' Is only asserted when (1) there is no path digit string at the node: or (2) the string at the cun^ent node has 
already been suocessfully matched. A 'string cyde' will not take place if load IDI length register' is asserted, because 

10 there can be no string if there is an IDI length stored at the node (because only one field in menwry is provided for stor- 
ing length information -when an IDI transition occurs, this field stores the ID! length and thus cannot store string 
lengths). 

[0154] A 'normal cycle" will select one of three potential next node addresses from three sources: the next-node 
pointer, the DSP pointer and the Level- 1 pointer. The selection will be made according to the following rules: 

IS 

(a) If "IDI length » O" is asserted on line 312. and 'IDI loaded' is asserted on line 314. and DSP transitions are glo- 
bally enabled, then a 'DSP cycle' is selected. The value of the DSP pointer accessed from memory 230 becomes 
the address of the next node. Only one 'DSP cyde' is allowed during the search operation, which prevents the 
search engine from being stuck in infinite loops. 
20 (b) If (a) does not apply, if 'CRL - 14' is asserted by counter 240 on line 310, and if the 'Level-I transition- flag is 
set at the current node, then, if Level-1 transitions are globally enabled, the Level-1 pointer accessed from register 
340 wilt be the source of the next node address. In this case, the cyde is a 'Level-1 cyde". Again, only one "Level- 
1 cycle' is allowed per search operation. 

25 [01 55] If either (a) or (b) apply, then the next machine cyde will be run without fetching a "next digit" from the senti- 
octet generator. That is, the same digit will be re-examined once the Level-1 or the DSP transition has been made. If 
neither (a) nor (b) apply, then the 'next node pointer' accessed from pointer DRAM array 200 will be the source of the 
next node address - i.e. this machine cyde will t>e a 'pointer cyde'. In this case, 'next digit' will be asserted on line 282 
to semi-octet generator 290 so that the next digit is available at the beginning of the next machine cycle. CRL register 

30 240 and IDIL register 250 are txrth decremented. 

[0156] For any "normal cyde", the following rules also apply: If the most significant node address bit is equal to 
zero, or if the "NIL node" flag is set. then this machine cyde will be the last for the cunent search argument (i.e.. current 
network address being searched). If the "save current node address" flag is set. then the node address of the current 
cyde will be saved in "result so far" register 295 (only if this is not a nil node), overwriting any former value. 

35 [01571 For a "string cycle', or for a 'pointer cycle' (i.e. any machine cyde that absorts a search digit), the current 
machine cyde will be the last if 'CRL s O" is asserted on line 284. In the case of a "string cycle", a string mismatch is 
declared. In the case of a "pointer cycle", the cun-ent node is declared to represent the greatest progress that can be 
made on the search address. If the signal "CRL = 0" is asserted during a 'DSP cyde" or a "Level-1 cycle", then the 
appropriate transition can still be made since it does not require a valid semi-octet to be present on semi-octet data bus 

40 204. 

[01 58] Enors in the database may be recognized by the machine, causing premature en-or termination. The follow- 
ing errors will be recognized: 

1 . A parity en^or between the cun^ent semi-octet and the current path digit string, as detected by parity check eie- 
45 ment 300; 

2. A parity error in pointer ORAM array 200, as detected by parity check element 302. if the error occurs during a 
"pointer cycle' O-e. when data from this memory is to be used as the next node address); 

3. When string length counter 260 has been inaemented beyond the legal range during a "string cycle'. 

so [01 59] Once the machine has stopped cycling, the 'BUSY' status bit is cleared and the result of the search can be 
read by the CPU. Appendix B details the format and interpretation of the search result. 

Perftamiance oredietton for ARE 

55 Search time 

[0160] The performance of the ARE is relalivety easy to predict. The search time will be roughly equal to the 
number of digits in the search argument multiplied by the machine cyde time. Some overhead must be added to this 
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estimate. 

[0161] Once every 15 microseconds, a machine cyde will be run to refresh DRAMs 200, 220. Only one such 'extra* 
cyde can be expected per search. Additionally, there may be two other machine cycles where a digit is not absortied: 
a 'DSP cycle" and a 'Level-1 cyde'. According to the invention, as discussed above, intrinsic hardware interlocks pre- 
vent more than one of each transition cyde per search argument even with corrupted databases. 
[0162] Thus, if the maximum length of a search argument is forty digits, then it will take at most 44 machine cydes 
to process the argument. (This Indudes one final 'dummy" machine cycle, run with no valid semi-octet on semi-octet 
data bus 204). To this must be added the overhead of loading the input and reading the output registers. This calculation 
is difficuH. especially when the data write and data read cycles are referenced to the instructions that cause them. 
[0163] The CPU register load cydes can all run with at nwst two wait states. The first machine cyde can start as 
soon as the first register file load cyde is detected. This time is of the order 300ns measured from the start of SO of the 
CPU write cyde to the start of the first machine cyda For reading the result. CPU DSACK signals can be asserted (or 
rather, the blocking of them can be released) coinddentally with the end of ttie last machine cyde. The erxl of S5 of the 
CPU read cyde wilt then occur within 150ns. 

[01 64] The CPU will typically transfer six longwords (up to 46 digits, only up to the first 40 being significant to the 
register file. The time measured from the start of the first transfer to the end of the read cyde reading the result will be. 
for a 208ns machine cyde: 

300ns + 44*208ns ->■ 150 ns - 9.6 (is. 

[0165] To give a more accurate figure for the ra/ailable spare CPU time, the transfer time of.six longwords should be 
subtracted from this figure (6*200ns =1.2 jxs). Tliis gives a result of 8.4 ns. 

[0166] The look up time of a LeveM address or IEEE 802.3 datalink address is much less, if the CPU prepends the 
address with an octet of value '1 2', and then transfers the result to register file 270. two longword writes will be required 
(the last octet of which is ignored). By making use of the fact that data in register file 270 is not ovenwritten, one of the 
eight input blocks (and, correspondingly, one of the eight root nodes) may be dedicated ro Level-I look ups - once the 
first octet is written with the v^ue '12', it will stay there. In this way. the Level-1 address can be written directly to the 
file. This will still require two write operations, but at least three CPU bus cycles mil be run by the interface hardware 
because of the misalignment. 

[0167] The root node for Level-1 kx>k up will (presumably) be the same as the initial node in the Level-1 branch of 
the database: no domain transitions can be expected. If a refresh cycle happens during the look up the look up time 
will be 12 digits ••■ refresh -i- dummy: 

300ns + 14*208ns + 150 ns = 3.4 ns. 
[0168] Subtracting the time for (probably 3) write cydes gives an effective look up time of 2.8 m& 



[0169] The datat>ase capacity of the inventive ARE presented here is relatively easy to calculate. Two database 
types are considered to coexist in the ARE : those allowing addresses up to16 semi-octets in total length to be recog- 
nized: and secondly, those allowing addresses up to 40 (potentially 46) semi-octets to be recognized. A total of up to 
eight different datat}ases can coexist, limited by the number of root nodes. 

[0170] The former databases are used for recognizing Level-I addresses, datalink addresses and router-IDs 
amongst others. Given that there are 52K of nodes with up to 16 semi-octet constant string storage at each node, of 
which 26K are terminal, it follows that a combined total of up to 26K entries for these databases can be guaranteed. (In 
the worst case, each transition node will have at most two next-node pointers; thus the tne becomes a binary trie with 
as many transition nodes as final nodes.) 

[01 71 ] Since there are 4K of nodes storing constant strings up to 48 semi-octets in length (of which 2K are termi- 
nal), at first it seems reasonable to suppose that up to 2K entries can be guaranteed for searching these longer 
addresses. However, in the worst case, extra transition nodes may be required - these extra nodes having one exit wily, 
namely the DSP pointer exit. (The DSP transition cannot be stored as part of the constant string.) Under these condi- 
tions, only IK entries can be guaranteed. Note that it is possible to redefine the ARE so the DSP transitions are allowed 
from terminal nodes - then the guarantee increases to 1 .3K entries in the worst case. Note also that it is possible to steal 
nodes from the other datatjases and 'chain" together the nodes with smaller constant string capacities. In fact, the cur- 
rent maximum length IDP defined is only 18 digits in length - hence, even the l&digit capacity nodes can be used for 
holding IDP digits without fear that constant string storage length with exceed node capacity. (If the maximum length 
IDP is 18 digits, then the maximum length IDI will be two less - API digits are not ordinarily stored as part of a constant 
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string.) 

[0172] F=br packet forwarding purposes, note that the Level-2databas is branched to the Level-1 dataiiase through 
the Level- 1 pointer. Thus it may be stated that a database can be built that guarantees recognition of up to 26K 
acklresses that are in-area (and will be routed by Levei-I routing) and up to 1 K addresses that are out-ol-area (and will 
be routed according to longest prefix). 

[0173] Other embodiments are within the scope of the claims that follow the appendices. 
[01 74] Ftor example, the invention may be used for searches of any large database that contains strings of (possibly 
widely) varying lengtti. Such databases are common: examples include directory look ups in a telephone context, on- 
line dictionaries, spelling checkers, looking up social security numbers, etc. In any such context where the directory 
look up is performed using a search trie, the invention would be applicable. Although the word 'node' has been used 
here to desaibe an element in a hierarchical tree sb-ucttre. the invention applies to any tree structure search, wthether 
or not the word 'node' is used to descrbe the tree structure. 
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Appendix Ai APR Memory Map 

The ARE is mapped into a 16 MB space in the Widget memory 
nap, between addresses 0800 0000 and OSff ffff . 

The 16 MB ARE space is divided into 64K "nodes", where each 
node is 256 octets in size. Thus, an ARE address (consisting of 
eight hexadecimal digits) may be broken up into three fields, as 
shown: 

08|nnnn|aa 

Here the first two digits, -08", select the ARE; the next four 
digits, "nnnn" select one of 64K nodes; and the final two digits, 
"aa" select a subspace within a node. 

The allowable subspaces that can be accessed at each node are 
defined in Fig. 9. 
Description of resoureas 
Node Control Wai-d 

The node control word is a byte-addressable l6-bit word 
supported at" every node. Not all bits are interpreted by the 
search engine, but all bits are implemented to facilitate ARE 
maintenance (e.g. by chaining free nodes). The format and 
explanation of this word is given below : 

|NIL|SR|lC|Ll|dH|dl0|d9|d8|d7|d6|<— LEKGTH — >| 
NIL (Nil Node) : If this bit is set, then this node is 
interpreted by the search engine as a nil node. At a nil node, 
the setting of any other control bits is irrelevant. Immediately 
upon entering a Nil Node, the search engine will stop, indicating 
"Nil Node" as the reason for stopping. Any string stored at a 
Nil Node is ignored. The address of the Nil Node is not reported 
in any result returned by the search engine. 

NOTE: All nodes whose address MSB is equal to zero (i.e. 
nodes 0000 through 7fff) are Terminal Nodes. During a search 
operation, if a Terminal Node is encountered the search engine 
will stop rather than to make a transition from the node. 

SR (Save Result) : During a search operation, all nodes 
encountered that have this bit set will be saved in Result So Far 
register 245. This single word register 245 will always contain 
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the address of the node most recently encountered with the SR bit 
set. The SR bit will not be examined by the search engine until 
any constant string present has been successfully matched against 
the search key; string mismatch (which includes premature 
exhaustion of the search key digits) will result in the node NOT 
being saved, despite the condition of the SR bit. The node 
address will not be saved if the node is MIL. 

IC (IDP Count present) : If this bit is set, then the value 
of the "length" field will be interpreted by the search engine as 
the length of an IDP. This bit should be set at the first node 
•following the second digit of the ATI. The Idngth field should 
be set to the appropriate IDP length that is indicated by the 
relevant AFI. Only one such value will be loaded during a search 
operation. The IC bit will be ignored if the node is NIL. 

NOTE: A constant string cannot be stored (and will be 
ignored) at any node with the IC bit set. 

LI (Levei-l transition possible): This bit will be examined 
by the search engine only when it has exactly fourteen digits of 
the search key remaining and it has successfully matched any 
constant string present against the search address. If this is 
the case, and the LI bit is set, then the search engine will use 
the value of the Level-1 pointer as the address of the next node. 
dll.dlO, , . . ,d6: These bits are not used by the search engine. 
LENGTH (5 bits): This field is interpreted in one of two 
ways: 

1. If IC is set, and NIL is not set, then this field is 
interpreted as the length of an IDP (as explained above) . 

2. If IC is not set, and NIL is not set, then this field is 
interpreted as the length of a constant string stored at the 
node. The constant string will be matched against the digits of 
the search key until there is a mismatch or until either source 
is exhausted. 

String storage snaegs 

String storage space is provided to store constant strings of 
variable length at a node. The string is stored as a succession 
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of digits, one digit per byte. (The four most significant bits 
of each byte are ign red on a write and always read as zero.) 
The length of the string (number of digits stored) is indicated 
in the Node Control Word. The naximum number of digits that may 
be stored is a function of the node address, as shorn below: 
Transition Nodes: 

fSOO <> Node Address <> ffff - maximum 48 digits. 
9000 <- Node Address <« f7ff - maximum 16 digits. 
8000 <> Node Address <- 8fff - no storage allowed. 

Terminal Nodes; 

7800 <= Node Address <= 7fff - maximum 48 digits. 

1000 <=» Node Address <= 77ff - naximum 16 digits. 

0000 <= Node Address <■ Offf - no storage allowed. 

("X <= y" means x is less than or equal to y) 

When the search engine encounters a node at which a path 
digit string is stored, the string will be compared against the 
remaining search address digits until either source of digits is 
exhausted or until there is a mismatch. 
Next Node Pointers 

Eighteen "next node" pointers are provided for all 32X 
Transition Nodes. (Pointer compression has not been 
implemented.) The first 32K nodes are "Terminal Nodes" and no 
transitions can be made from them. 

A next-node pointer will be selected by the search engine if 
a string match has been successfully completed (where relevant) , 
if the node is not a terminal or NIL node, and if there are 
search address digits remaining. The selection is made according 
to the following rules (refer to Theory of Operation, above) : 

1. If an IDI length has been loaded into the IDIL register 
250 and that length has been decremented to exactly zero (or, if 
not, and the current node is loading a total IDP length of 2 into 
IDIL) - i.e. if the search engine believes that the next search 
key digit to be processed is the first digit of the OSP - then 
the DSP Pointer will be selected. The OSP Pointer can be 
selected at most once during a search operation. Selection of 
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the DSP Pointer does not consume the current search key digit - 
it will be used in the next oachine cycle. 

2. Otherwise, if there are exactly fourteen search key digits 
remaining and the LI bit is set in the Node Control Word at this 
node, then the Level-1 Pointer will be selected. The Level-1 
Pointer can be selected at most once during a search operation. 
Selection of the Level-1 Transition Pointer does not consume the 
current search key digit - it will be used in the next machine 
cycle. 

NOTE : If both (1) and (2) apply, the conditions for (2) will 
"be re-examined after (1) has been effected. 

3. Otherwise, the next digit of the search key will be used 
to select one of the first sixteen next-node pointers. For 
example, if the digit is zero, then the pointer at address 40 
will be used. (Pointer address offset is 40, refer to Fig. 9) . 
Search Address Storage Area 

32 bytes of storage space are provided at each of nodes 8000 
through 8007. Of these 32 bytes, the first 24 bytes are used for 
loading a new search address; the result of the search is written 
by the search engine into the last eight bytes. Up to eight 
search contexts are supported - but note that a new search should 
not be initiated until the current one is complete (because this 
will abort the current search) . When a search is initiated, the 
node used to load the search address will be the root node used 
by the search engine. The format of the search address must be 
as follows: 

First byte (address 80/aO) : length of search address. 
Subsequent bytes : search address, length as defined in first 
byte. 

The length information written to the first byte is 
interpreted in one of two ways : If the byte is written to 
address 80, it is interpreted as an octet length. I.e. the number 
of digits provided is twice the value of the length byte; if the 
length byte is written to address aO, it is interpreted as a 
semi-octet length, explicitly indicating the number of digits in 
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the search key- Address space aO-bf is otherwise an alias of 
address space 80-9f . (NOTE: th current ARE implementation 
regards a semi-octet length in excess f 40 (decimal) or equal to 
zero as an error) . 

More digits can be written to the search key area than are 
implied by the length octet; at most 32 octets in total should be 
written or else the thirty-third octet will either reinitialize 
the search or cause a bus error. If more than 24 octets are 
supplied, the excess octets will be overwritten with the search 
result at the time that it becomes available. If fewer octets 
are supplied than the length field indicates, then the Search 
engine will wait indefinitely for them to be supplied unless it 
terminates the search prematurely. 
Search Result Informal^ 

The result of a search is available at addresses 98 - 9f 
(equally b8 - bf) at the same node in which the search key was 
loaded (i.e. one of nodes 8000 through 8007). The format of the 
result is as follows: 

Ottset. Inter«fetati»n 

98 or b8 Summary status byte. 

99 or b9 Remaining semi-octet length of search address at 

termination of search, [byte] 

9a-9b or Most recent node encountered with SR bit set 
. ba-bb [unsigned word] . 

9c or be Remaining semi-octet length of IDI at termination 
of search [signed byte] . 

9d or bd Value of string digit counter at termination of 
search, [byte] 

9e-9f or Kode most recently encountered during search 
be-bf (except if this is a KIL node) . [unsigned word] 

Refer to appendix B for detailed information on interpreting 
the result information. 
gontrQl and gtatWg Registers 

Parity Control and status r eaistgr o Address OBOOOOcO 
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foraat: I PES I PEP I EPS | EPP I 0 I o | o I o I 

reset state: |o|o|o|o|o|o|o|o| 
Status register 0 provid s parity control and status for CPU 
accesses to the database neaories that are implemented with 
DRAMS. 

PES: Set whenever parity error is detected on CPO read of 
string storage aeoory. Reset by CPU overwriting with a one. 

PEP: Set whenever parity error is detected on CPU read of 
next-node pointers. Reset by CPU overwriting with a one. 

EPS: Read/write by CPU. When zero, use odd parity (generate 
^nd check) on string storage memory; when set to one, use even 
parity . 

EPP: Read/write by CPU. When zero, use odd parity (generate 
and check) on next-node pointer memory; when set to one, use even 
parity. 

Control Register 1 - Address OSOOOOcl 

format: I SM I NSAP I ILl j IDSP | SDK I d2 I dl I dO I 

reset state; |o| o|o| o|o|o|o|o{ 

There is one control register for the entire ARE. The bits 
are defined below: 

SM (Search Mode): When SM = 0 {= "maintenance mode"). Search 
Mode is inhibited and search result information is invalid; the 
search engine is reset. This mode must be selected before the 
CPU can access any resource below address It. When SM = 1, 
Search Mode is enabled. This mode must be selected before a 
search key is loaded, and must be maintained until the result has 
been read. In this mode, all resources below address 7f are 
inaccessible. 

NSAP (NSAP mode) : This bit influences only the setting of the 
success/ failure bit in the result summary octet at the end of a 
search. (refer to Appendix B) . 

ILl (Inhibit Level-i transitions): If this bit is set, then 
during any search a Level-l transition will not be made. 

IDSP (Inhibit DSP transition): If this bit is set, then 
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during any search, a DSP transition will not be made. 

SDK (Stall DSACK) : If this bit is set, then reading the search 
result inf rmation (defined under Search Result Inf mation 
above) will result in a possibility extended bus cycle that will 
be stalled until the result is available; this eliminates the 
need to check if the result is available. This is applicable to 
every context, but only in Search Mode. 
d2,dl,dO - not used by the ARE. 

Search Status Register 2 - address OSOOOOcZ (read only) 
format: I BCNT I RT2 I RTl I RTO I 0 I 0 I 0 I 0 I 

reset state: | u|u|u|u|o|o|o|o| 

The four status bits available from this register are 
provided primarily for diagnostic purposes. The bits are valid 
only when the search engine is either HALTed or BUSY (as defined 
in status register 3). 

BCNT: This bit is set when the search engine is assuming that 
the first byte of the search address is a octet-wise length 
count; if BCNT is cleared, then a semi -octet (digit) count is 
assumed. 

RT2 . . 0 : These three bits are equal to the three least 
significant bits of the root address that the search engine is 
using for the purpose of reading the search argument. 

Search Status R egister 3 - address 080000c3 (read only) 
format: I BUSY I halt | f5 | f4 I f3 | f2 | fi | fo | 

reset state: |o |o lo|o|o|o|o|o| 

This status register provides search engine status for 
application and diagnostic use. 

BUSY: This bit is set when the search engine is processing 
nodes or is loading the result. 

HALT: This. bit is set when the search engine has stopped 
processing nodes. 

The above two bits should be decoded as a pair. 
BUSY HALT Interpretation 
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0 o Search engine is reset. 

1 O Search engine is searching. 

1 1 Result is being loaded (short -transitional state) . 

0 1 Result is available for reading. 

f5..f0: These bits are set by %n:iting components of the 
search key into the register file, and are used by the search 
engine as an indication that valid data is present. 

All bits of register 3 are reset whenever the machine is in 
maintenance node (bit 7 of control register 1 is zero) . Bits f 5 
through fl are reset whenever the CPU writes to the last byte of 
the first longword of the search key; at the sane tiae> bit fO is 
set (if search mode is enabled). Bit fl is set, in search node, 
when the CPU writes to the last byte of the second longword of 
the search key; and bits f2 through f5 are similarly set on 
writing the last byte of the third through sixth longwords. 

ARE Status Fi?qist;9r 4 - address 080000c4 (read only) 

format: I s7 I s6 I s5 I s4 I s3 I s2 I si I sO I 

reset state: - - - - - - - - 



This status register allows the CPU to read the positions of 
eight switches in the on-board switchpack. 
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Appendix B; Decoding the Search R esult from the ARE 

The result f a search is available at addresses 98 - 9f 
(equally bS - bf ) at the same node in which the search key was 
loaded; i.e., one of nodes 8000 through 8007. If the control bit 
"SDK" is set (bit 3 in control register 1) , then the result may 
be read at any time after loading the search key - the CPU will 
be stalled by hardware until valid data can be returned, if the 
"SDK" bit is not set, then the CPU must poll status register 3 to 
determine when search result status is valid. 

The format of the result is as follows; minimum delay will be 
incurred if the result is read as two longwords. 

address | field contents 
(* = 0..7) 



08800*98 
08800*9C 



<— summary->|<-Rem. Len.->|<-Result So Far->| 
|<-Rem. IDI->|<-str. Cnt.->|< Got So Far— > 



ZisM Size 

Svunmary byte 

Rem. Len. byte 

Result So Far word 



Rem. IDI. byte 



Got So Far word 



Suiptary gtflt^g PYt:e, 



interpm^t^pn 

Summary status byte (see below) . 

Number of remaining search key digits not 

processed by the search engine 
Most recent node in TRIE traversed by 

search engine that had "save result" 
bit set. 

Mtimber of remaining IDI digits in search 
key not processed by search engine 
(value may be negative) . 
String counter value - provides 

information in cases of string mismatch 
search termination. 
Node being processed by search engine at 
time of termination (except if this is 
a NIL Node) 



The summary status byte is provided so that the host CPU t 
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rapidly determine whether a search was successful or not; and if 
not, the reason for the search failure. 

The encoding depends upon whether the database is deemed by 
the search engine to be corrupted or not; and is further 
influenced by the "NSAP node" bit in Control Register 1. 

The two cases are dealt with separately below: 
Case 1 ; database corrupted. 

For a corrupted database, the encoding of the output will be: 

I 1 I 1 I 0 I ISC I PEP I PES ) LIND | LlOE | 

where 

ISC » Illegal String Count reached (the CPU indicated^ that 
there were more string digits present than the maxinua allowed at 
a node) 

PEP = Parity Error on Pointer memory 
PES « Parity Error on String memory 

LlND = Level-l transition made but no DSP transition made 
(NSAP mode only) 

LlDE = Level-l transition made and source digits exhausted 
(NSAP mode only) 

The setting of bits ISC, PEP and PES is mutually exclusive. 

If ISC is set, then the search engine attempted to run a 
string comparison cycle with a digit pointer that exceeded the 
maximum string length allowed at a node. The "Got So Far" field 
contains the node at which the string is stored; the "String 
Count" field contains the string digit pointer which is too 
large. 

If PEP is set, then a transition was made from a node through 
one of the sixteen "next node" pointers. The address indicated 
for the next node contained a parity error. The "Got So Far" 
field contains the node at which the corrupted "next node" 
pointer is stored. The "Remaining Length" field indicates how 
many digits of the search key remain to be processed by the 
search engine; the first of these remaining digits is the one 
immediately following the digit that was used to select the 
corrupted "next node" pointer. 
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If PES is set, then a string comparison cycle was run but the 
St red string digit contained a parity error. The "Got So Far" 
field contains the node at which the corrupted string is stored. 
The "String Count" field points to the digit after the one at 
which a parity error was detected. 

The setting of the LIND and LIDE bits is independent of the 
ISC, PEP and PES bits. At the termination of the search, for 
whatever reason, the LIND bit will be set if NSAP node is 
selected, and a Level-1 transition has been made but not a DSP 
transition. The LIDE bit will be similarly set if a Level-l 
transition has been made and there are no remaining seaz'ch key 
digits - the search engine expects that there will normally be at 
least two remaining digits at the termination of an NSAP search 
if a Level-1 transition has been made (these two digits are the 
SEL field) . 

case 2 ; database not earyuptPrt. 

If the database is not considered to be corrupted, then the 
encoding of the bits in the Summary Status Byte will be: 

I F I 0 I RL I NIL I SM I DE I LI I DSP I 

where 

F = search Failure (O = success) . This bit is influenced by 
the NSAP Mode bit. 

RL = "Result So Far" field is valid. 

NIL = NIL Node reached 

SH = String Mismatch occurred 

DE = Source Digits Exhausted 

LI = Level-1 transition made 

DSP = DSP transition made 

The F bit is 0 when the search is successful. A successful 
search is defined as one in which the database is not corrupted, 
the supplied length is in the range 1 to 40 semi-octets, and, in 
addition, in NSAP mode either a) a Level-1 transition has not 
been made and the Result-So-Far register has been loaded at least 
once; then the encoding of the summary status byte will be: 00 IX 
XXOX — or b) a Level-1 transition has been made, there has been 
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no string-mismatch, a terminal node has been reached at which the 
SR bit is set, and (implicitly) a DSP transition has been made 
and not all of the search key digits have been used. The 
encoding of the summary status byte in this case will be: 0010 
0011. 

In non-NSAP mode a successful search is where the database is 
not corrupted, the search address length is in the prescribed 
length, all of the search address digits have been used, there 
has been no string mismatch (and the search address digits did 
not become exhausted during a string match) , and the final node 
reached (which is not a NIL Node) has the SR bit set. The final 
node may or may not be a Terminal Node — if it is a Terminal 
Node, then the "DE" bit will be zero (even though the digits are 
exhausted) . The encoding of the Summary Status Byte will be: 
0010 OXXX 

The RL bit is set after the "Result So Far" field is loaded 
with a node address during a search. This bit indicates that the 
"Result So Par" field contains valid data; if RL equals zero, 
then the value of the "Result So Far" field is undefined (in 
preferred embodiments, it will contain the root node). 

The setting of the LI bit depends upon whether the control 
bit "ILl" in control register 1 is set or not. If it is not set, 
then Level-1 transitions are enabled, and the LI status bit will 
be set only if a Level-l transition was made during the search. 
Otherwise, if the "ILl" bit is set, then Level-1 transitions are 
inhibited; in this case, the LI status will be set if, at any 
time during the search, a transition was made from a node (not a 
DSP transition) at which the node control bit "Enable Level-1 
Transition" was set. 

The DSP bit will be set if and only if a DSP transition was 
made during the search. If the control bit "IDSP" is set in 
control register 1, then DSP transitions are inhibited and the 
DSP (transition made) status bit will always be zero. 

The NIL, SN, DE bits encode the reason why the search engine 
stopped in the case where the database is not corrupted. Each 
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case is dealt with in detail bel w; note that, as explained 
above, the "Result So Far" field is only valid if the RL bit is 
set. 

a) NIL, SM, DE = 000: This encoding means that a terminal 
node was reached by the search engine. The "Got So Far" field 
contains the terminal node. The "Result So Far" field contains 
the node most recently traversed that had the "SRSF" node control 
bit set; if "SRSF" is set at the tominal node, then "Result So 
Far" will be equal to "Cot So Far". The "Remaining Length" field 
indicates how many search key digits were not used by the search 
engine; the value may be zero, indicating that the very last 
digit was responsible for taking the search engine to the 
terminal node. 

b) NIL, SM, DE = 001: This encoding means that the search 
engine stopped because it ran out of search key digits; it would 
otherwise have made a transition from the node at which it came 
to rest. The "Got So Far" field contains the node reached by the 
search engine when the digits ran out. This node is neither NIL 
nor a terminal node. If there is a string stored at the node, 
then it will have been successfully matched. If the "SR" bit is 
set at this node, then the "Result So Far" will be equal to "Got 
So Far"; otherwise, it will contain the node most recently 
traversed that had the "SR" bit set. Note that a Level-1 or a 
DSP transition can be made from node even when the search address 
digits are exhausted; thus, neither a DSP nor a Level-l 
transition can be made from the final node reached as indicated 
by "Got So Far". 

c) NIL, SM, DE a 010: This encoding means that the search 
engine stopped because it detected a mismatch between a constant 
string digit and the next digit of the search key. The "Got So 
Far" field contains the node at which the constant string is 
stored. The "String Count" field points to the string digit 
after the one which mismatched. The "Remaining Length" field 
indicates how many search key digits have not been processed by 
the search engine; the first of these remaining digits 
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immediately follows the digit that was mismatched against the 
constant string digit. 

The "Result So Far" field contains the node most recently 
visited (before the one at which the search terminated) that had 
the "SRSF" bit set. 

d) NIL, SK, DE - Oil: This encoding means that the search 
engine stopped because it could not perform the string digit 
comparison cycle that it was tasked with. This occurred because 
either: (i) the digits in the search argument became exhausted, 
or (ii) specifically, the IDP digits of the search argument 
Isecame exhausted while comparing against a path digit ^ring 
storing only IDP digits. The "Got So Far" field contains the 
node at which the string is stored. The "String Count" field 
points to the string digit that would have been compared against 
(i) the next search key digit, or (ii) the next IDP digit, had 
there been one. 

The "Result So Far" field contains the node most recently 
visited (before the one at which the search terminated) that had 
the "SR" bit set. 

e) NIL, SM, DE ■> 100: This encoding means that the search 
engine reached a NIL node. The "Got So Far" field contains the 
node most recently visited before the NIL Node; similarly, the 
"Result So Far" field contains the node most recently visited 
(before the NIL Node) at which the "SR" bit was set. 

The transition to the NIL node was made by using a search key 
digit to leave the node indicated in the "Got So Far" Field. 
This digit is the last that was used by the search engine; it is 
not included in the "Remaining Length" or "Remaining IDI length" 
counts . 

f) NIL, SM, DE " 110: This encoding also means that the 
search engine reached a NIL node. In this case, the transition 
was made by using either a DSP or the Level-l pointer to leave 
the node indicated in the "Got So Far" field. In the usual case, 
the Level-l pointer will not point to a NIL Node and so it can be 
assumed that a DSP transition was made to the NIL Node. 
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Note that, even though the DSP transition was to a NIL Node, 
the status bit "DSP Transition Made" will b set; the value f 
the "Remaining IDI length" will be zero. The value of the 
"Remaining length" field will be equal to the entire DSP length 
of the supplied search key. 

g) NIL, SM, DE =» 111: This encoding means that an illegal 
search key length was supplied to the search engine; i.e., zero 
or more than forty digits. In this case, the search engine makes 
no progress at all on the seaurch key. 
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Overview of maintenance support required 

Maintenance engine 18 inserts a new entry into the database by 
first presenting it as a search argument to search unit 40. Search 
unit 40 attempts to locate the search argument in the database; if it 
fails, it will, at least, report the longest prefix of the search 
argument that exists in the database. (This longest prefix will not 
necessarily be associated with "result" in the database) . 

At most two new nodes will be inserted in the database; one new 
node will be added to differentiate the new entry from all existing 
entries - this node will be inserted at the point where the search 
unit located the longest prefix of the new entry; one additional new 
node will be added, and will contain a path compression string equal 
to all of the remaining digits of the new entry beyond the prefix 
match with the database. 

The differentiation node only needs to be added where the new 
entry was being compared against a path compression string the 
database, and a mismatch occured during one of the comparisons. In 
this case, the path compression string must be broken in two at the 
S>oint of mismatch. The part which compared successfully against the 
new entry is stored in a new node (the differentiation node) , which 
will have two non-nil pointers. One pointer points to the node which 
originally held the path compression string (the original path 
compression string is replaced by the part of it which mismatched 
against the new entry) . The second pointer points to the new node 
which holds the path compression string (if any) of the remaining 
digits of the new entry. 

"^vt inverse process « "-S*^ eUleWan. 
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Appendix C; Menorv Savings of Pointer a nd Path Compress inn 



The analysis presented here shows roughly how much physical 
memory can be saved by applying the techniques of path 
compression and pointer compression to a TRIE based database. 
For the analysis, it will be assumed that there are up to sixteen 
pointers at a node, corresponding to four-bit characters (semi- 
octets) of the search argument processed at each node. For 
different sizes of characters, the analysis should be performed 
with a parameter substituted for the fixed size of sixteen 
pointers assumed here. Additionally, the idiosyncrasies of ISO 
and DECnet Phase-V addresses are not considered. 

The analysis will consider the memory requirements for 
storing a worst -case database with a given number of entries. 
Firstly, path compression will be considered, followed by the 
application of pointer compression in addition to path 
compression. 

Consider a TRIE based database that is required to store up 
to N entries. Let the entries be of variable length up to a 
maximum of L semi -octets. Each node in the TRIE will contain 
sixteen pointers. In the worst case, a maximal number of nodes 
is required to hold these N entries. 

The worst case will occur when all distinctions between the 
entries are made at nodes as close to the root as possible. In 
the way, every entry will have associated with it a large chain 
of nodes each using only one non-nil pointer pointing to the next 
link in the chain. The depth of TRIE required to specify all N 
entries will be log,jN. The total number of nodes used up to 
this level will be {N-l)/15. 

Nodes that are used within the long chains make up the bulk 
of the total number of nodes required. There will be N chains. 
If every entry in the database is L semi-octets long, then every 
chain will be of length {L-log„M) , the end of each chain 
pointing to a terminal node for which no storage requirement will 
be assumed. 
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Thus, the total number of nodes required will be; 
(N-l)/15 + (L-log,4N)*N 

each f size 16 pointers. To a good first approximation, 
this can be simplified to 

L*N nodes. ^2) 

Now consider that path compression is implemented. Long 
chains of nodes will now be collapsed into terminal nodes or into 
nodes with at least two pointers. The greatest amount of memory 
will be required when there are as many (non- terminal) nodes as 
possible. This is the case when every non-terminal node has 
•exactly two pointers (for example, a binary ttie) . The total 
number of non-terminal nodes will be K-1. Terminal nodes will 
require no storage for pointer; thus the memory requirement for 
pointers will be (with sixteen pointers per node) 

(N-1)*16 pointers 

There is also some memory requirement for the path 
compression strings. The distribution of the strings amongst the 
nodes can vary widely; but if there are M entries in the database 
each of length L semi-octets, the total storage requirement 
cannot exceed {N*L) semi-octets, if a pointer is equated to four 
semi-octets, then adding the requirement of equation (3) gives a 
total memory requirement of 

(N-l)*64 + (N*L) - N*(64+L) semi-octets (4) 
to a good approximation. When compared with the storage 
requirement for the same sized database without path compression, 
equation (2), the ratio of (2) to (4) is 

(64 * L)/(64 + L) (5j 
which is bounded by the limits of i (for small L) and 64 (for 
large L) . in the case of ISO 8348/AD2 network addresses, for 
which L-40, the reduction factor is 24.6, representing a very 
significant savings in physical memory. 

Note that an implementation may want to allocate enough 
memory for a maximxm length path compression string at every 
node, in order to make memory management easier. In this case, 
approximately twice as much memory is required to store the path 
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coopression strings, and equation (5) is modified t 

(32 * L) / (32 + L) (6) 

Which has an upper bound of 32, and the reduction factor for the 
case of ISO addresses is 17.8, still very significant. 

Next consider adding pointer compression to a path compressed 
TRIE. The worst case storage requirement for a path compressed 
TRIE without pointer compression is given by equation (4). When 
pointer compression is introduced, the worst case occurs when as 
many actual (i.e. non-nil) pointers as possible are used, since 
each non-nil pointer points to a node, this occurs when as many 
nodes as possible are used. Thus the worst case occurs^' for the 
same binary trie structure used in the non pointer compressed 
case above. The size of each non-terminal node will be two 
pointers plus a bit mask. The bit mask is sixteen bits wide, 
essentially the same size as a pointer. Thus the storage 
requirement for the path compressed and pointer compressed TRIE 
is, in the worst case, 

(N-l) * 3 pointers + (N * L) semi-octets = (7) 

K* (12 + L) semi-octets (8) 
to a good approximation. By comparison with the storage 
requirement for path compression alone, equation (4) , there is a 
further reduction in memory requirement by a factor of 

(64 + L) / (12 + L) (9) 
having a value between 1 and 5. For the case of ISO addresses, 
where L » 40, the reduction factor is 2. 

The overall reduction factor for a path compressed and 
pointer compressed database, compared to a database with no 
compression, is the product of equations (5) and (9) . This is 

(64 + L) / (12 + L) (10) 
which is bounded by the limits of 5 for small L and 64 for large 
L. For the case of ISO addresses, the overall reduction factor 
is 49. 



1 . A method of conducting a search of reduced length through a stored dataljase containing a tree structure of nodes 
(110) and pointers (105). nodes with pointers being parent nodes and targets of pointers being child nodes, point- 
55 ers to child nodes corresponding to each of a number of possible values that can occur as fragments of a search 
argument, successive fragments of said search argument defining a search path following said pointers through 
said nodes, at least some of said nodes (110-11) identifying a result value (120), 

characterized by elinranating a removable node (1 10-7) that occurs between an immediately previous (1 10-2) and 
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an immediately following (110-1 1) node in said search path, the removable node having only one child node and 
either no result value or the same result value as its parent, the method comprising the machine-implemented 

5 storing information (125) describing conditions under wMch, had said eliminated node been present the 

search would have proceeded to said following node, 
following said search path through said nodes. 

when said search reaches said previous node (110-2). determining whether said search argument meets the 
conditions of said stored information (12^, and 
10 causing said search to effectively progress from said previous node (1 10-2) directly to said following node 

(110-11) if the comparison indicates that, had said diminated node (110-7) been present, said search would 
have proceeded to said folkwing node. 

2. The method of daim 1 . wherein 

IS 

■ the result value that would have been provided by said eliminated node is equal to the result value that is pro- 
vKied by said following node. 

3. The method of daim 1 . wherein said information (125) is stored in said following node (1 10-1 1). 

20 

4. The method of claim 3. wherein 

said search argument comprises a series of search segments, each segment of said argument con-esponding 
to a node (1 10) along said search path, 
ss at least one value of a segment con'esponding to a given node is related to a pointer (105) leadng from said 

given node to another node, and some other values of said segment are related to a terminating mode of said 
search. 

5. The method of claim 4, wherein 

30 

said stored information (125) is a given value of a search segment con'esponding to said eliminated node, 
sakl given value being a value that would have been related to a pointer leading from sad eliminated node 
(1 10-7) to said following node (1 10-1 1). 

35 6. The method of daim 5, further comprising 

associating one or more sets of indicators (1 15) relatively with one or more nodes (110), each indicator in a 
given set associated with a given node indicating a value of a segment of said search argument con-esponding 
to said given node that is related to said terminating mode, and wherein 
40 following said search path comprises processing successive search segments of said search argument, 

processing of a given search segment having a given value including 

retrieving the set of indicators associated wnth a node along said search path that is assodated with said 
given search segment arxt determining whether an indicator in said set indicates that said given value is 
45 related to said terminating mode, and 

proceeding to said tenninating mode if an indicator indicates that said given value relates to said terminat- 
ing mode, othenwise 

retrieving a pointer leading from said given node that is related to said given value, and following said 
pointer to a subsequent node along said search path. 

so 

7. The method of daim e.wherein said terminating mode of said search comprises terminating said search. 

8. The method of daim 7 wherein 

55 said search argument comprises a system address in a network, and 

said result values comprise routing information for transmitting packets through said network to a device des- 
ignated by said address. 
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9. A method for associating routing information with a packet address in a distritxjted digital data processing networK 
comprising the machine-implemented steps of: 

storing a database tree structure of non-nil nodes (1 10) and poimers (105) leading respectively from one of 
said nodes to another of said nodes, nodes with pointers being parent nodes and targets of pointers being child 
nodes, at least some of said nodes identifying a result value, 

a path through said nodes following said pdntere being identified by values of corresponding respective seg- 
ments of said address, each value of a segment identifying a different pointer to a subsequent node along one 
said path, 

storing nil nodes (110-10,1 10-17) ending said paths, 

storing, as said result values, routing information in one or more of said nodes, the routing infonnation stored 
in a node being associated with any packet address ttiat identifies a path passing through said node, and char- 
acterized by: 

replacing a given node (110-7) having only one pointer leading from said given node (110-2) to a non-nil sub- 
sequent node (110-11) and having either no result value or the same result value as Its parent node, with a 
path string (1 25). said path string storing the value of a segment conresponding to the given node that identifies 
the pointer to said one non-nil subsequent node. 

10. The method of claim 9. further comprising 

replacing other nodes having only one pointer leading respectively from said other nodes to a non-nil subse- 
quent node, until no such nodes remain in said datat)ase. 

11. Apparatus (20) for conducting a search of reduced length through a stored database (50) containing a tree stmc- 
ture of nodes (110) and pdmers (105). nodes with pointers being parent nodes and targets of pointers being child 
nodes, pointers to child nodes con-esponding to each of a number of possible values that can occur as fragments 
of a search argument, successive fragments of said search argument defining a search path following said pointers 
through said nodes, at least some of said nodes (110-11) identifying a result value (120). said apparatus eliminat- 
ing a removable node (110-7) that would otherwise occur between an immediately previous node (1 10-2) and an 
immediately following node (1 1 0-1 1) in said search path, the removable node having only one child node and either 
no result value or the same result value as its parent the apparatus comprising: 

a memory (220) storing information (125) describing conditions under which, had said eliminated node been 
present, the search would have proceeded to said following node, 
a controller (40) for following said search path through said nodes, 

a comparator (300) for determining whether said search argument meets the conditions of said stored informa- 
tion (125) when said controller reaches said previous node (1 10-2). wherein 

said controller causes said search to effectively progress from said previous node (1 10-2) directly to said fol- 
lowing node (110-11) H said comparator indicates that, had said eliminated node (110-7) been present, said 
search woidd have proceeded to said following node. 

12. The apparatus of dalm 11 , wherein 

the result value that would have been provided by said elinunated node is the result value that is provided by 
said following node. 

PatentansprQche 

1. Ein Verfahren zum DurchfOhren einer Suche reduzierter Lflnge durch eine gespeicherte Datenbank, die eine 
Baumstruktur von Knoten (110) und Zeigern (105) enthatt, wobei Knoten mit Zeigern Elternknoten (parent nodes) 
und Targets von Zeigern Kinderknoten (child nodes) sind, wobei Zeiger auf Kinderknoten jedem einer Anzahl mOg- 
licher Werte entsprechen, die als Rragmente eines Sucharguments auftreten konnen, wobei aufeinandertalgende 
Fragmente des Sucharguments einen den Zeigern durch die Knoten folgenden Suchweg definieren. woba minde- 
stens einige der Knoten (110-11) einen Ergebniswert (120) identifizieren, 

gekannzechnet durch Elimlnieren eines entfernbaren Knotens (1 10-7), der zwischen einem unmittelbar vorherge- 
henden (1 10-2) und einem unmittelbar folgenden (1 10-1 1) Knoten in dem Suchweg auftritt. wobei der entfernbar 
Knoten nur einen Kinderknoten und entweder keinen Ergebniswert Oder den gleichen Ergebniswert wie seine 
Eltern aufweist. 



EP0 408188B1 



wobei das Verfahren folgende maschinen-implementierte Schrrtte umfaBt: 

Speichern von Information (125), die Bedingungen beschreibt, unter denen - ware der eiiminierte Knoten vor- 
handen gewesen - die Suche zu dem fdgenden Knoten vorangeschritten ware. 
5 Folgen des Suchweges durch die Knoten, 

Bestimmen, ob das Suchargument die Bedingungen der gespeicherten Information (125) erfuilt, wenn die 
Suche den vorhergehenden Knoten (110-2) erreicfrt. und 

Veranlassen, daB die Suche wirksam von dem vorhergehenden Knoten (1 10-2) cfirekt zu dem (bigenden Kno- 
ten (1 10-1 1) voranschreitet. wenn der Vergleich angibt, daS - ware der eiiminierte Knoten (1 10-7) vorhanden 
10 gewesen • die Suche zu dem fblgenden Knoten vorangeschritten ware. 

2. Das Verfahren gemaB Anspruch 1 . wobei 

der Ergebniswert, der durch den eliminierten Knoten geliefert worden ware, gleich dem Ergebniswert ist, der 
IS durch den fblgenden Knoten geliefert wird. 

3. Das Verfahren gemaB Anspruch 1. wobei die Infbrnnation (125) in dem folgenden Knoten (1 10-1 1) gespeichert ist. 

4. Das Verfahren gemaB Anspruch 3. wobei 

20 

das Suchargument eine Reihe von Suchsegmenten umfaBt wobei jedes Segment des Arguments einem Kno- 
ten (1 10) entlang des Suchweges entspricht, 

mindestens ein einem gegebenen Knoten entsprechender Wert eines Segments mit einem Zeiger (105) in 
Beziehung steht der von dem gegebenen Knoten zu einem weiteren Knoten fOhrt, und einige weitere Werte 
2S des Segments mit einem terminierenden Modus der Suche in Beziehung stehen. 

5. Das Vertahren gemaB Anspruch 4. wobei 

die gespeicherte Information (125) ein gegebener Wert eines Suchsegmentes ist. der dem eliminierten Knoten 
30 entspricht wobei der gegebene Wert ein Wert ist, da- mit einem Zeiger, der von dem eliminierten Knoten (110- 

7} zu dem ftrigenden Knoten (1 10-1 1) fuhrt in Beziehung gestanden ware. 

6. Das Verfahren gemaB Anspruch 5, das ferner umtaBt: 

35 ein relatives VerknOpfen einer Oder mehrerer Satze von Indikatoren (1 15) mit einem oder mehreren Knoten 

(110), wobei jeder einem gegebenen Knoten zugeordneter Indikator in einem gegebenen Satz einen Wert 
eines Segments des Sucharguments angibt der dem gegebenen Knoten entspricht, der mit dem terminieren- 
den Modus in Beziehung steht. und wobei 

ein Folgen des Suchweges en Verarbeiten aufeinanderfolgender Suchsegmentedes Sucharguments umfaBt, 
40 wobei ein Verarbeiten eines gegebenen Suchsegments mit einem gegebenen Wert die folgenden Schritte 

umfaBt: 

ein Auslesen des Satzes von Indikatoren, die einem Knoten entlang des Suchweges zugeotdnet sind, der dem 
gegebenen Suchargument zugeordnet ist, und ein Bestimmen, ob ein Indikator in dem Satz angit>t daB der 
gegebene Wert mit dem terminierenden Modus in Beziehung steht, und 
45 ein Voranschreiten zu dem terminierenden Modus, wenn ein Indikator angibt daB der gegebene Wert mit dem 

terminierenden Modus in Beziehung steht. andernfalls 

ein Auslesen eines von dem gegebenen Knoten, der mit dem gegebenen Wert in Beziehung steht, fOhrenden 
Zeigers und ein Folgen des Zeigers zu einem nachfolgenden Knoten entlang des Suchweges. 

so 7. Das Verfehren gemaB Anspruch 6. wobei der terminierende Modus der Suche &n Terminieren der Suche umtaBt. 

8. Das Verfehren gemaB Anspruch 7. wobei 

das Suchargument eine Systemadresse in einem Netzwerk umfaBt. und 
55 die Ergebniswerte eine Leitweginformation zum Clbertragen von Paketen durch das Netzwerk an eine duich 

die Adresse bestimmte Vbrrichtung umteBt. 



9. Ein Verfahren zum Zuordnen einer Leitweginformation zu einer Paketadresse in einem verteilten digitalen Oaten- 
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vetarbeitungsnelzwefk, das die folgenden maschinenimplementierten Schritten umfaBt 

Speichern einer Datenbank-Baumstruktur von Nicht-Null-Knoten (110) und Zeigern (105), die jeweils von 
einem der Knoten zu einem anderen der Knoten fuhren, wobei Knoten mit Zeigern ElternKnoten und Targets 
von Zeigern Kinderknoten sind. wobei mindestens einige der Knoten einen Ergebniswert identifizieren, 
wobei ein Weg durcti die Knoten. der den Zeigern Iblgt, durch Werte entsprechender jeweiliger Segmente der 
Adresse identifiziert wird. wobei jeder Wert eines Segments einen unterschiedlichen Zeiger zu einem nachfol- 
genden Knoten entiang dieses einen Wegs identifiziert 
Speichern von Null-Knoten (110-10, 110-17), die die Wege beenden, 

Speichern, als de Ergebniswerte, von Leitwegirrfbrmation in einem oder mehreren der Knoten, wobei die in 
einem Knoten gespeicherte Leitweginformation mit einer beliebigen Paketadresse verKnOpft ist die einen 
durch den Knoten fuhrenden Weg identifiziert, gekennzeichnet durch: 

Ersetzen eines gegebenen Knotens (110-7). der nur einen von dem gegebenen Knoten (110-2) zu einem 
nachfolgenden Nicht-Null-Knoten (1 10-11) fQhrenden Zeiger aufweist und entweder keinen Ergebniswert oder 
den gleichen Ergebniswert wie sein Etternknoten aufweist durch einen Weg-String (125), wobei der Weg- 
Strlng den Wert eines dem gegebenen Knoten entsprechenden Segmentes speichert. der den Zeiger zu 
einem nachfolgenden Nicht-Null-Knoten identifiziert. 

1 0. Das Vertahren gemaS Anspruch 9, das ferner den folgenden Schritt umfaBt 

ein Ersetzen weiterer Knoten, die nur einen jeweils von den anderen Knoten zu einem nachfolgenden Nicht- 
Null-Knoten fOhrenden Zeiger aufweisen, bis keine solchen Knoten in der Datent»ank vertteiben. 

11. Vorrichtung (20) zum DurchfOhren einer Suche reduzierter Unge durch eine gespeicherte Datenbank (50). die 
eine Baumstruktur von Knoten (1 10) und Zeigern (10^ enthait, wobei Knoten mit Zeigern Elternknoten und Targets 
von Zeigern Kinderknoten sind, Zeiger zu Kinderknoten jedem einer Anzahl von mOglichen Werten entsprechen, 
die als Fragmente eines Sucharguments auftreten kOnnen, wobei aufeinanderfblgende Fragmente des Suchargu- 
ments einen den Zeigern durch die Knoten folgenden Suchweg definieren, wobei mindestens einige der Knoten 
(110-11) einen Ergebniswert (120) identifizieren, wobei die Vorrichtung einen entfernbaren Knoten (110-7) ellmi- 
niert, der andernfalls zwischen einem unmittelbar vorhergehenden Knoten (1 10-2) und einem unmittelbar folgen- 
den Knoten (110-1 1) in dem Suchweg auftreten wuide, wobei der entfernbare Knoten nur einen Kinderknoten und 
entweder keinen Ergebniswert Oder den gleichen Ergebniswert wie seine EHern aufweist mit folgenden Merkma- 
len: 

einem Speicher (220), der Information (125) speichert, die Bedingungen beschreibt, unter denen - ware der 
eliminierte Knoten vortianden gewesen - die Suche zu dem folgenden Knoten vorangeschritten w&re. 
einer Steuereinheit (40) zum Fblgen des Suchweges durch die Knoten, 

einem Komparator (300) zum Bestimmen. ob das Suchargument die Bedingungen der gespeicherten Informa- 
tion (125) erfOllt, wenn die Steuereinheit den vorhergehenden Knoten (110-2) erreicht, wobei 
die Steuereinheit veranlaBt daB die Suche wirksam von dem vorhergehenden Knoten (110-2) direkt zu dem 
folgenden Knoten (110-11) voranschreltet, wenn der Komparator anzeigt, daB - ware der eliminierte Knoten 
(110-7) vorhanden gewesen - die Suche zu dem folgenden Knoten vorangeschritten ware. 

12. Die Vorrichtung gemaS Anspruch 11. wobei 

der Ergebniswert, der durch den eliminierten Knoten geliefert word en ware, der Ergebniswert ist der durch 
den folgenden Knoten geliefert wird. 

Revendications 

1 . ProcMd pour conduire une recherche de longueur r6duite k travers une base de donn6es m6nxjrls6e contenant 
une structure en arbre de noeuds (110) et de pointeurs (105). les noeuds avec des pointeurs 6tant des noeuds 
p6res et les cibles de pointeurs 6tant des noeuds fits, les pointeurs aux noeuds f lis correspondant k chacune d'un 
certain nombre de valeurs possftjies pouvant apparaltre en tant que fragments d'un argument de recherche, les 
fragments successifs dudit argument de recherche ddfinissant un trajet de recherche suivant lesdits pointeurs & 
travers lesdits noeuds, au moins certains desdits noeuds (1 10-1 1) identifiant une valeur de r6sultat (120), 
caract6ris6 par la suppression d'un noeud 6liminable (1 10-7) pouvant apparaltre entre un noeud imm6diatement 
prteMant (110-2) et un noeud inrvnMiatement suivant (1 10-1 1) sur ledit trajet de recherche, le noeud 6liminEd)le 
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ayant seulement un noaid fOs et. soil aucune valeur de r6sultat, soit la mSme valeur de r^ltat que son pdre, 
le procM6 oomprenant les Stapes mises en oeuvre par machine et qui consistent : 

k m^moriser des informations (125) dto^ivant les conditions dans lesquelles, si ledit noeud &\rmn6 avait 6t6 
5 present la recherche se serait poursuivie audit noeud suivarrt. 

k suivre ledit trajet de recherche au travers desdits noeuds, 

lorsque ladite recherche attaint ledit noeud prteMent (1 1 0-2), la d^ermination quant k savoir si ledit argument 
de recherche satisfait ou non aux conditions desdites informations m^morlsdes (125). et 
k amener ladite recherche k progresser efficacement dudit noeud pr6c6dent (110-2) directement audit noeud 
10 suivant (1 10-1 1) si la comparaison indique que, si ledit noeud 6\imm6 (1 10-7) avart 6t6 prfeent. ladite recher- 

che se serait poursuivie audt noeud suivant 

2. ProcM6 selon la revendcation 1 , dans lequet 

IS la valeur de rteultat qui aurait 616 d6livr6e par ledit noeud 6limin6 est ^e k la valeur de rteuKat qui est d^n- 

' vr6e par ledit noeud suivant. 

3. ProcMd selon la revendication 1 , dans lequel lesdites informations (1 25) sont m6moris6es dans ledrt noeud sui- 
vant (110-11). 

so 

4. ProcM6 selon la revendication 3. dans lequel 

ledit argument de recherche comprend une s^ie de segments de recherche, cheque segment dudit argument 
con-espondant k un noeud (1 1 0) le long dudit trajet de recherche, 
25 au moins une valeur d'un segment correspondant k un noeud donnd est Ii6e k un pointeur (105) oonduisant 

dudit noeud donn6 k un autre noeud. et certaines autres valeurs dudit segment sont Ii6es k un mode de fin de 
ladite recherche. 

5. ProcM6 selon la revendication 4, dans lequel 

30 

lesdites informations mdmorisdes (125) sont une valeur donnte d'un argument de recherche con-espondant 
audit noeud 6limin6, 

ladite valeur donn^e ^nt une valeur qui aurait 6(6 \\6e k un pointeur conduisant dudit noeud ditmni (1 10-7) 
audit noeud suivant (110-1 1). 

35 

6. ProcM6 selon la revendication 5, comprenant, en outre 

rassoctation d'un ou de plusieurs ensembles d'indicateurs (115) relativement d un ou plusieurs noeuds (1 10), 
chaque Indlcateur d'un ensennble donn6 associ6 k un noeud donn6 indlquant une valeur d un segment dudit 

40 argument de recherche con-espondant audit noeud donn6 qui est li^e audit mode de fin, et dans lequel 

le suivi dudit trajet de recherche comprend le trattement des segments de recherche successifs dudit argu- 
ment de recherche, le traltement d'un segment de recherche donn^ ayant une valeur donn^e incluant 
I'extraction de I'ensemble des Indicateurs assodte k un noeud le long dudit trajet de recherche qui est associS 
audit segment de recherche donnk, et la determination quant k savoir si un indlcateur dudt ensemt)le indique 

45 que ladite valeur donn6e est lite audit mode de fin, et 

le passage audit mode de fin si un indlcateur indique que ladite valeur donn6e est lite audit mode de fin, sinon 
I'extraction d'un pointeur conduisant dudit noeud donn6 qui est 116 k ladite valeur donnte, et le suivi dudit poin- 
teur k un noeud suivarrt le long dudit trajet de recherche. 

50 7. Proc&16 selon la reirendication 6. dans lequel ledit mode de fin de ladite recherche comprend la fin de ladite 
rechercha 

8. ProcM6 selon la revendication 7, dans lequel 

55 ledit argument de recherche comprend une adresse de systdme dans un rteeau, et 

lesdites valeurs de rteultat oomprennent les informations de routage pour transmettre des paquets au travers 
dudit rteeau jusqu'd un dispositif dteignd par ladite adresse. 
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9. ProcMd pour associer des informations de routage kune adresse de paquet dans un r6seau de traitement de don- 
n^es num^riques distribu6. comprenant les 6tapes mises en oeuvre par machine et qui consistent : 

k mSmoriser une structure en arbre de base de donn6es de noeuds non nuls (11 0) et de pointeurs (105) con- 
5 duisant respectivement de I'un desdits noeuds k un autre descEts noeuds, les noeuds avec des pointeurs 6tant 

des noeuds p^res et les cibles des pointeurs 6tant des noeuds fils, au moins certains desdits noeuds identi- 
fiant une valeur de rteultat, 

un trajet au travers desdits noeuds suivant lesdits pointeurs qui sont identifies par des valeurs des segments 
respectifs correspondants de ladite adresse, chaque valeur d'un segment identifiant un different pointeur ft un 
10 noeud suivant le long d'un dit trajet. 

d m^oriser des noeuds nuls (1 10-10, 1 10-17) terminant lesdits Irajets, 

k m6moriser, en tant que lesdites valeurs de rdsultat, les informations de routage dans un ou plusieurs desdits 
noeuds, les informations de routage m6moris6es dans un noeud 6tant associ^ & une queiconque adresse 
de paquet qui identifle un trajet passant par ledit noeud, et caract6ris6 par : 
15 le remplacement d'un noeud donn^ (1 10-7) ayant un seut pointeur conduisant dudit noeud donn6 (110-2) k un 

noeud suivant non nul (1 10-11) et ayant soH aucune valeur de r^sultat, soit la mfime valeur de r^ltat que 
son noeud pkre, par une chaTne de trajet (125), ladite chaTne de trajet m^morisant la valeur d'un segment cor- 
respondent au noeud domd qui identif ie le pointeur audit un noeud suivant non nul. 

20 10. Proc6d6 selon la revendication 9, conprenant, en outre 

le remplacement des autres noeuds ayant un seul pointeur conduisant respectivement desdits autres noeuds 
k un noeud suivant non nul, jusqu'ft ce qu'aucun de ces noeuds ne demeure dans ladite base de donn^es. 

25 11. Apparell (20) pour conduire une recherche d'une longueur rMuite au travers d'une base de donn^es (50) corrte- 
nant une structure en arbre de noeuds (11 0) et de pointeurs (1 05). les noeuds avec des pointeurs 6tant des noeuds 
pdres et les cibles de pointeurs 6tant des noeuds fils. les pointeurs aux noeuds fils oon'espondant k chacune d'un 
certain nombre de valeurs possibles pouvant apparattre en tant que fragments d'un argument de recherche, des 
fragments successifs dudit argument de recherche ddfinissant un trajet de recherche suivant lesdits pointeurs au 

30 travers desdits noeuds, au moins certains desdits noeuds (1 10-1 1) identifiant une valeur de rteuttat (120), ledit 
appareil supprimant un noeud dlinrnnable (1 10-7) qui. sinon. apparaltrait entre un noeud immMiatement prteMant 
(1 10-2) et un noeud immMiatement suivant (1 10-1 1) sur ledit trajet de recherche, le noeud 6liminable ayant seu- 
lement un noeud fils et soit aucune valeur de rdsuHat, soit la mSme valeur de rteuHat que son pftre. I'appareil com- 
prenant : 

35 

une m^moire (220) mSmorisarrt des informations (1 25) d6aivant les corxlitions dans iesquelles. si ledit noeud 
supprim6 avalt &6 present, la recherche se serait poursuivie audit noeud suivant. 
un contrfileur (40) pour suivre ledit trajet de recherche au travers desdits noeuds, 

un comparateur (300) pour determiner si ledit argument de recherche satisfait ou non aux conditions desdites 
40 informations m6moris6es (1 25) lorsque ledit contrfileur atteint ledit noeud pr6c6dent (1 1 0-2). dans lequel 

ledit contrfileur amdne ladite recherche k progresser efficacement dudit noeud prdcMent (1 10-2) directement 
audit noeud suivant (1 1 0-1 1 ) si ledit comparateur indique que. si ledit noeud dlimind ( 1 1 0-7) avait iHk present, 
ladite recherche se serait poursuivie audit noeud suivant. 

45 12. Appareil selon la revendication 1 1 . dans lequel 

la valeur de r^sultat qui aurait &.6 deiivr^e par ledit noeud ^limine est la valeur de r^sultat qui est d6livr6e par 
ledit noeud suivant. 
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from Node 
00-01 
02-Of 
10-1f 



Contains 



Node control work (16 bits) (Fast Static RAM). 
Nothing * 

String storage space; 16 semi-octets, one semi- 
octet per octet of storage (String DRAIVI). 
(Nodes 1000-7ftf and 9000-ffff only). 

Additional string storage space; 32 more serrii- 

octets. one per octet of storage (String DRAM). 
(Nodes 7800-7fH and fSOO-ffff only). 

Sixteen "next node" pointers (Node Pointer DRAfW). 
(Nodes 8000 - tfff only). ** 

DSP-transition pointer (DSP Pointer SRAIWI). 
(Nodes 8000 - tfff only). 

Level-1 transition pointer (same physical pointer 
is accessible at nodes 8000 - ftff only ). 



Nothing * 

Storage space for key to Ije searched and result of 
search (eight contexts supported, one each at 
nodes 8000 through 8007 only). 

Alias address of above (see note below). 



co-ff Control and status registers (nodes 0 through 7 can 

bes used to access one register set). 

NOTE: Resources at addresses 00 -7f can only be accessed when the ARE 
is in maintenance mode. 

. • "Nothing" indicates that read data is undefined, write data is ignored. 
Fields defined for only some "N" nodes also store "Nothing in the remaining nodes. 

** The "next node" pointers are those that will be accessed (during a 
search) according to the current digit. A digit of "0' will access the pointer at 
address offset 40; a digit of "1" will access the pointer at address offset 42; etc. 
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